w_title(html2stx)dnl
w_doc_id(h2sman)dnl
w_section(1)dnl
w_author(Panu A. Kalliokoski)dnl
w_man_desc(convert HTML documents into Stx)
! SYNOPSIS
''html2stx'' [ /file/ ]
! DESCRIPTION
''html2stx'' takes the given /file/, which should contain an HTML
document, and converts it to structured text (Stx). If no file is
given, standard input is read instead.
The program does not attempt to convert every possibly convertible piece
of markup into Stx. For example, w_lt`'font`'w_gt tags are simply
ignored. This tends to result in a nice, clean, beautiful document.
(If it doesn't, the source document probably does not contain enough
information to start with.)
! OPTIONS
None.
! DIAGNOSTICS
''html2stx'' is a python script and will throw an exception if something
goes amiss. In this case, the return value will be non-zero.
! SEE ALSO
''stx2any'' (1), _Stx markup reference_ (''PREFIX/share/doc/stx2any/examples/Stx-ref.txt'')
! BUGS
- The word wrapping algorithm is probably not very clever.
- Sometimes there are extra linebreaks in the output.
- Probably many others.
! AUTHOR
This manual page was written by w_author.
''html2stx'' is derived from the ''html2text'' utility by Aaron Swartz.
''html2text'' is a utility for converting html into "Markdown"
structured text; the changes required to make it work for Stx were done
by Panu Kalliokoski.