This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Homepage Repository PyPI
pip install transmogrify.htmlcontentextractor==1.0
The Tidelift Subscription provides access to a continuously curated stream of human-researched and maintainer-verified data on open source packages and their licenses, releases, vulnerabilities, and development practices.
Login to resync this project