[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Patch to emacs-w3m

From: Paul Kinnucan <paulk@xxxxxxxxxxxxx>
Date: Thu, 16 Oct 2003 13:12:33 -0400
X-ml-name: emacs-w3m
X-mail-count: 05960
References: <16260.65445.650000.941492@gargle.gargle.HOWL><16262.39042.951551.144968@jpl.org><16266.13548.810000.67568@gargle.gargle.HOWL><yotlzng5hbgr.fsf@jpl.org><16267.35978.630000.418070@gargle.gargle.HOWL><b9y3cdwibuw.fsf@jpl.org><b9yfzhwgqr5.fsf@jpl.org><86r81fu672.fsf@turing.ccd.uab.es><16268.12354.603000.991466@gargle.gargle.HOWL><86ptgz4bzn.fsf@turing.ccd.uab.es><b9ybrsj5o6b.fsf@jpl.org><16268.54841.800000.806206@gargle.gargle.HOWL><b9y4qyb3sem.fsf@jpl.org>

Katsumi Yamaoka writes:
 > >>>>> In [emacs-w3m : No.05922]
 > >>>>>	Paul Kinnucan <paulk@mathworks.com> wrote:
 > 
 > > As promised, here is a version of w3m-decode-entity-string
 > > that handles strings with multiple embedded entities without
 > > creating a temporary buffer.
 > 
 > > (defun w3m-decode-entity-string (encoded-str)
 > >   "Decode entities in the string STR."
 > 
 > Thanks for the new code.  However, TSUCHIYA Masatoshi also
 > brought up a similar code yesterday in the emacs-w3m mailing
 > list.  It will probably be more efficient since there are less
 > concat's than yours.  Here it is:
 > 
 > (defun w3m-decode-entities-string (str)
 >   "Decode entities in the string STR."
 >   (save-match-data
 >     (let ((pos 0) (buf))
 >       (while (string-match w3m-entity-regexp str pos)
 > 	(setq buf (cons (or (w3m-entity-value (match-string 1 str)
 > 					      (match-beginning 2))
 > 			    (match-string 0 str))
 > 			(cons (substring str pos (match-beginning 0))
 > 			      buf))
 > 	      pos (match-end 0)))
 >       (if buf
 > 	  (apply 'concat
 > 		 (nreverse (if (< pos (length str))
 > 			       (cons (substring str pos) buf)
 > 			     buf)))
 > 	str))))
 > 

This version is much faster than mine and hence is preferable.

 > Anyway, we aren't immediately going to implement the new code
 > now.  Since there are some bugs (see below) and we should fix
 > them, putting similar codes on two places (w3m-decode-entities
 > and w3m-decode-entities-string) may confuse the development.
 > 

I would suggest renaming w3m-decode-entities as 
w3m-decode-entities-buffer to emphasize that it operates on
buffers. This would also avoid confusion with 
w3m-decode-entities-string.

 > (w3m-decode-entities-string "&ltx;")
 >  => "&ltx;"
 
Why is this a bug? If the function cannot decode an entity, shouldn't
it simply return the entity?

Again, thanks for all your work.

Paul

References:
- Patch to support emacs-w3m
  - From: Katsumi Yamaoka
- Re: Patch to emacs-w3m
  - From: Katsumi Yamaoka
- Re: Patch to emacs-w3m
  - From: Paul Kinnucan
- Re: Patch to emacs-w3m
  - From: Katsumi Yamaoka
- Re: Patch to emacs-w3m
  - From: Katsumi Yamaoka
- Re: Patch to emacs-w3m
  - From: Jose A. Ortega Ruiz
- Re: Patch to emacs-w3m
  - From: Paul Kinnucan
- Re: Patch to emacs-w3m
  - From: Jose A. Ortega Ruiz
- Re: Patch to emacs-w3m
  - From: Katsumi Yamaoka
- Re: Patch to emacs-w3m
  - From: Paul Kinnucan
- Re: Patch to emacs-w3m
  - From: Katsumi Yamaoka

Prev by Date: Re: need force-header-line-update
Next by Date: Re: sb-zdnet エラー
Previous by thread: Re: Patch to emacs-w3m
Next by thread: Re: Patch to support emacs-w3m
Index(es):
- Date
- Thread

Namazu Search: [Help]