htmltextconvert 0.1.2

Creator: rpa-with-ash

Last updated:

Add to Cart

Description:

htmltextconvert 0.1.2

htmltextconvert renders HTML to plain text, for example to autogenerate a plain
text versions of HTML emails, or to index HTML documents for search.
It differs from other packages in these ways:

Pure Python, no dependencies
High quality, well tested code
Permissive license (Apache)
Renders the HTML to text suitable for an text/plain email body (it doesn’t
aim to convert to a structured text format like markdown, but rather at giving
a readable text-only representation of the rendered HTML).

Usage:
>>> import htmltextconvert
>>> print(
... htmltextconvert.html_to_text(
... """
... <p>This is a paragraph.</p>
... <p>This is another paragraph.</p>
... """
... )
... )
This is a paragraph

This is another paragraph
htmltextconvert handles the following HTML tags:

Character entity references (&name;, &#nnnn;, &#xhhhh)
Unordered lists (<ul>)
Ordered lists (<ol>)
Paragraphs (<p>)
Block quotes (<blockquote>)
Linebreaks (<br>)
Links (<a href="…">)
Bold (<strong>)
Italic (<em>)
Code (<code>)

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.