Re: Looking for a HTML --> text utility

---------

era eriksson (reriksso@cc.helsinki.fi)
Thu, 20 Jun 1996 20:10:38 +0300


On Thu, 20 Jun 1996 11:50:19 -0400 (EDT),
"Andrew D. Taylor" <af883@freenet.carleton.ca> wrote:
> Earlier, Michael C. Taylor (most excellent last name) asked:
>> On Wed, 19 Jun 1996, Al Gilman wrote:
>>> Have you played with "lynx -dump"?
>> Is there a noticable difference between lynx -dump and using print to a
>> local file?
> Yes. lynx -dump isn't as nice IMO. A print to local file prints the lynx
> rendering. A lynx -dump footnotes each link by adding an ugly [12] where
> the achor command is and listing the URLs at the end.

This was an, er, improvement from older versions. If you can find an
older Lynx, save it in a safe place. (Sorry, no detailed idea what
version numbers and so on. The people who actually +use+ Lynx can
probably tell you that ;^)
Of course, you can always post-process the output from Lynx but the
older output format was certainly easier to deal with. (I'll sketch it
here, though: Replace all /\[[0-9]+\]/:s with nothing, and chop off
the last portion of consecutive lines. Easy enough if you know Perl.)
The nice thing about lynx -dump is, above all, that you can run it
unattended, like in a biweekly cron job or from a makefile. You could
always try to fake this by running something like Expect around Lynx
to feed it the necessary keystrokes, and stay with the
fetch-and-print-to-local-file procedure.

Just some random thoughts,

/* era */

PS. The last time this was discussed, Vic Metcalfe sounded like he was
thinking about writing SGML authoring tools to implement all of this
in a more structured and extensible way. I guess I did too, passively.
Whatever happened to that? I'm still interested in contributing if I
can, only it'll have to wait until +when+ I can :-)

-- 
See <http://www.ling.helsinki.fi/~reriksso/> for mantra, disclaimer, etc.


[ Usenet Hypertext FAQ Archive | Search Mail Archive | Authors | Usenet ]
[ 1993 | 1994 | 1995 | 1996 | 1997 ]

---------

faq-admin@landfield.com

© Copyright The Landfield Group, 1997
All rights reserved