python-html2text (1) Linux Manual Page
NAME
python-html2text – manual page for python-html2text 2016.9.19
SYNOPSIS
python-html2text [(filename|url) [encoding]]
OPTIONS
–version- show program’s version number and exit
-h,–help- show this help message and exit
–default-image-alt=DEFAULT_IMAGE_ALT- The default alt string for images with missing ones
–pad-tables- pad the cells to equal column width in tables
–no-wrap-links- wrap links during conversion
–ignore-emphasis- don’t include any formatting for emphasis
–reference-links- use reference style links instead of inline links
–ignore-links- don’t include any formatting for links
–protect-links- protect links from line breaks surrounding them with angle brackets
–ignore-images- don’t include any formatting for images
–images-to-alt- Discard image data, only keep alt text
–images-with-size- Write image tags with height and width attrs as raw html to retain dimensions
-g,–google-doc- convert an html-exported Google Document
-d,–dash-unordered-list- use a dash rather than a star for unordered list items
-e,–asterisk-emphasis- use an asterisk rather than an underscore for emphasized text
-bBODY_WIDTH,–body-width=BODY_WIDTH- number of characters per output line, 0 for no wrap
-iLIST_INDENT,–google-list-indent=LIST_INDENT- number of pixels Google indents nested lists
-s,–hide-strikethrough- hide strike-through text. only relevant when
-gis specified as well –escape-all- Escape all special characters. Output is less readable, but avoids corner case formatting issues.
–bypass-tables- Format tables in HTML rather than Markdown syntax.
–ignore-tables- Ignore table-related tags (table, th, td, tr) while keeping rows.
–single-line-break- Use a single line break after a block element rather than two line breaks. NOTE: Requires
–body-width=0 –unicode-snob- Use unicode throughout document
–no-automatic-links- Do not use automatic links wherever applicable
–no-skip-internal-links- Do not skip internal links
–links-after-para- Put links after each paragraph instead of document
–mark-code- Mark program code blocks with [code]…[/code]
–decode-errors=DECODE_ERRORS- What to do in case of decode errors.’ignore’, ‘strict’ and ‘replace’ are acceptable values
