beautifulsoup4-slurp

Slurp packages Beautifulsoup4 into command line.


License
MIT
Install
pip install beautifulsoup4-slurp==0.0.2

Documentation

BeautifulSoup4 packaged into a command line tool.

For now this tool just parses HTML tag soup with BeautifulSoup4, and writes out the results. This can help for example to properly parse the structure of a Netscape bookmarks file, which omits many ending tags.

Installation

From the Python package index (Pypi):

(sudo) pip install beautifulsoup4-slurp

or from Github:

git clone https://github.com/peterhil/slurp.git cd slurp (sudo) python setyp.py install

Usage

Show help:

slurp -h

Parse with html5lib and pretty print into stdout:

slurp -i bookmarks.html -p 'html5lib' -y

Parse with lxml and pretty print into stdout:

slurp -i bookmarks.html -p 'lxml' -y

Write pretty-printed to output to file:

slurp -y -i bookmarks.html -o bookmarks_soup.html

Pipe into slurp:

echo '<title>Slurp!</title><p><a href="https://github.com/peterhil/slurp/">Github</a>' | slurp -y

License

Copyright (c) 2015, Peter Hillerström <peter.hillerstrom@gmail.com> All rights reserved. This software is licensed under MIT license.

For the full copyright and license information, please view the LICENSE file that was distributed with this source code.