[Date Prev][Date Next][Thread Prev][][Date Index][Thread Index]

Minor fix for sb-heise.el



Hi,

sb-heise.el isn't properly extracting the message body in some recent
articles (example: http://www.heise.de/newsticker/meldung/51214). The
attached patch updates several regular expressions accordingly.

Thanks
Andreas
Index: sb-heise.el
===================================================================
RCS file: /storage/cvsroot/emacs-w3m/shimbun/sb-heise.el,v
retrieving revision 1.4
diff -c -r1.4 sb-heise.el
*** sb-heise.el	8 May 2004 00:39:42 -0000	1.4
--- sb-heise.el	18 Sep 2004 10:39:31 -0000
***************
*** 38,44 ****
  
  
  (defvar shimbun-heise-content-start
!   "\\(<!-- Meldung -->\\|<!-- INHALT -->\\)")
  (defvar shimbun-heise-content-end
    "\\(<!-- untere News-Navigation -->\\|<!-- INHALT -->\\)")
  
--- 38,44 ----
  
  
  (defvar shimbun-heise-content-start
!   "\\(<!-- Meldung -->\\|<!-- INHALT -->\\|<HEISETEXT>\\)")
  (defvar shimbun-heise-content-end
    "\\(<!-- untere News-Navigation -->\\|<!-- INHALT -->\\)")
  
***************
*** 134,140 ****
  	      "+0000"))))))
  
      ;; get the real from
!     (let ((regexp-from-begin "<!-- Meldung -->")
  	  (regexp-from-end "<!-- untere News-Navigation -->")
  	  (regexp-from (concat "(<a href=\"mailto:\\([^@]+@ct.heise.de\\)\""
  			       "[^>]*>\\([^<]+\\)</a>"))
--- 134,140 ----
  	      "+0000"))))))
  
      ;; get the real from
!     (let ((regexp-from-begin "<!-- Meldung -->\\|<HEISETEXT>")
  	  (regexp-from-end "<!-- untere News-Navigation -->")
  	  (regexp-from (concat "(<a href=\"mailto:\\([^@]+@ct.heise.de\\)\""
  			       "[^>]*>\\([^<]+\\)</a>"))
***************
*** 154,160 ****
  
      ;; strip ads
      (goto-char (point-min))
!     (let ((regexp-ad-begin "<!-- Meldung -->")
  	  (regexp-ad-end "<!-- untere News-Navigation -->")
  	  (regexp-ad "<!--OAS AD=\"Middle[0-9]*\"-->")
  	  (tmp-point) (bound-min) (bound-max))
--- 154,160 ----
  
      ;; strip ads
      (goto-char (point-min))
!     (let ((regexp-ad-begin "<!-- Meldung -->\\|<HEISETEXT>")
  	  (regexp-ad-end "<!-- untere News-Navigation -->")
  	  (regexp-ad "<!--OAS AD=\"Middle[0-9]*\"-->")
  	  (tmp-point) (bound-min) (bound-max))