webcorpus

Generate large textual corpora for almost any language by crawling the web


Keywords
dataset, corpus
Licenses
GPL-3.0/GPL-3.0+
Install
pip install webcorpus==1.0