article-extraction 0.3.0

Last updated:

0 purchases

article-extraction 0.3.0 Image
article-extraction 0.3.0 Images
Add to Cart

Description:

articleextraction 0.3.0

Article extraction library.
article-extraction is a package that can be used to extract the article content
from an HTML page.
Installation
Use poetry to install the library from GitHub.
poetry add "git+https://github.com/pmatigakis/article-extraction.git"

Usage
Extract the content of an article using article-extraction.
from urllib.request import urlopen

from articles.mss.extractors import MSSArticleExtractor

document = urlopen("https://www.bbc.com/sport/formula1/64983451").read()
article_extractor = MSSArticleExtractor()
article = article_extractor.extract_article(document)
print(article)

License:

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.