Upstream is June 5 👉 RSVP

transmogrify.webcrawler
Release 1.0b5

Crawling and feeding html content into a transmogrifier pipeline

Homepage Repository PyPI Python

Keywords: transmogrifier, blueprint, funnelweb, source, plone, import, conversion, microsoft, office
License: GPL-2.0+
Install: pip install transmogrify.webcrawler==1.0b5

Documentation

Crawling - html to import

transmogrify.webcrawler will crawl html to extract pages and files as a source for your transmogrifier pipeline. transmogrify.webcrawler.typerecognitor aids in setting '_type' based on the crawled mimetype. transmogrify.webcrawler.cache helps speed up crawling and reduce memory usage by storing items locally.

These blueprints are designed to work with the funnelweb pipeline but can be used independently.

Dependencies: 0
Dependent packages: 0
Dependent repositories: 1
Total releases: 13
Latest release: Jan 9, 2013
First release: Mar 22, 2010
Stars: 9
Forks: 5
Watchers: 113
Contributors: 6
Repository size: 993 KB
SourceRank: 9

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

1.2.1: Jan 9, 2013
1.2: Dec 28, 2012
1.1: Apr 17, 2012
1.0: Jun 29, 2011
1.0b7: Feb 17, 2011
1.0b6: Feb 12, 2011
1.0b6dev: Feb 12, 2011
1.0b5: Feb 6, 2011
1.0b4: Dec 13, 2010
1.0b3: Nov 9, 2010

See all 13 releases

Contributors

See all contributors

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2023-12-02 01:28:18 UTC

Login to resync this project