html2text-2.7 (1) - Linux Manuals

html2text-2.7: manual page for html2text-2.7 2016.9.19


html2text-2.7 - manual page for html2text-2.7 2016.9.19


html2text-2.7 [(filename|url) [encoding]]


show program's version number and exit
-h, --help
show this help message and exit
The default alt string for images with missing ones
pad the cells to equal column width in tables
wrap links during conversion
don't include any formatting for emphasis
use reference style links instead of inline links
don't include any formatting for links
protect links from line breaks surrounding them with angle brackets
don't include any formatting for images
Discard image data, only keep alt text
Write image tags with height and width attrs as raw html to retain dimensions
-g, --google-doc
convert an html-exported Google Document
-d, --dash-unordered-list
use a dash rather than a star for unordered list items
-e, --asterisk-emphasis
use an asterisk rather than an underscore for emphasized text
-b BODY_WIDTH, --body-width=BODY_WIDTH
number of characters per output line, 0 for no wrap
-i LIST_INDENT, --google-list-indent=LIST_INDENT
number of pixels Google indents nested lists
-s, --hide-strikethrough
hide strike-through text. only relevant when -g is specified as well
Escape all special characters. Output is less readable, but avoids corner case formatting issues.
Format tables in HTML rather than Markdown syntax.
Ignore table-related tags (table, th, td, tr) while keeping rows.
Use a single line break after a block element rather than two line breaks. NOTE: Requires --body-width=0
Use unicode throughout document
Do not use automatic links wherever applicable
Do not skip internal links
Put links after each paragraph instead of document
Mark program code blocks with [code]...[/code]
What to do in case of decode errors.'ignore', 'strict' and 'replace' are acceptable values