Last updated:
0 purchases
htmltextconvert 0.1.2
htmltextconvert renders HTML to plain text, for example to autogenerate a plain
text versions of HTML emails, or to index HTML documents for search.
It differs from other packages in these ways:
Pure Python, no dependencies
High quality, well tested code
Permissive license (Apache)
Renders the HTML to text suitable for an text/plain email body (it doesn’t
aim to convert to a structured text format like markdown, but rather at giving
a readable text-only representation of the rendered HTML).
Usage:
>>> import htmltextconvert
>>> print(
... htmltextconvert.html_to_text(
... """
... <p>This is a paragraph.</p>
... <p>This is another paragraph.</p>
... """
... )
... )
This is a paragraph
This is another paragraph
htmltextconvert handles the following HTML tags:
Character entity references (&name;, &#nnnn;, &#xhhhh)
Unordered lists (<ul>)
Ordered lists (<ol>)
Paragraphs (<p>)
Block quotes (<blockquote>)
Linebreaks (<br>)
Links (<a href="…">)
Bold (<strong>)
Italic (<em>)
Code (<code>)
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.