greek_sites_crawler

Web crawler for plenty of greek sites


License
GPL-3.0
Install
pip install greek_sites_crawler==1.0

Documentation

greek_sites_crawler

Programm which can crawl plenty of greek sites #Sites which can crawl skai,cnn,newsbomb,newsit,newsbeast,protothema,zougla ,tovima,avgi,capital,documentonews,efsyn,enikos,huffingtonpost,iefimerida,in_gr,left,liberal,naftemporiki,news247,nooz,protagon,tanea,thetoc

Run

Only you need to have is Python3 and the BeautfulSoup

Let see how you can run it

python3 greek_sites_crawler -url "site's url"

Return

That return a json in this formation {'topic':, 'title':<title>, 'article':

'publish_time':<publish_time> }