The 2024 Tidelift state of the open source maintainer report! 📊 Read now!

pextract
Release 0.3

Extract main textual information from HTML.

Homepage PyPI Python

License: MIT
Install: pip install pextract==0.3

Documentation

Webpage_Textual_Extraction

an uniform webpage extraction algorithm

Requirement

Python 3.5, requests, bs4

How to use

add the links you want to extract into pool.txt
set the encoding you want
run main.py

Dependencies: 3
Dependent packages: 0
Dependent repositories: 0
Total releases: 2
Latest release: Aug 6, 2018
First release: Aug 2, 2018
Stars: 0
Forks: 0
Watchers: 1
Contributors: 1
Repository size: 572 KB
SourceRank: 6

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

0.3: Aug 6, 2018
0.2: Aug 2, 2018

Contributors

See all contributors

Login to resync this project