API to extract content from HTML & XML documents


Keywords
lxml, dom, html, query-api, xml, xpath, xpath-api
License
Other
Install
pip install selection==0.0.21

Documentation

Selection

Travis CI coveralls.io

API to query DOM tree of HTML/XML document.

Usage Example

Example:

from selection import XpathSelector
from lxml.html import fromstring

html = '<div><h1>test</h1><ul id="items"><li>1</li><li>2</li></ul></div>'
sel = XpathSelector(fromstring(html))
print(sel.select('//h1')).text()
print(sel.select('//li').text_list()
print(sel.select('//ul').attr('id')

Installation

Run:

pip install -U pip setuptools
pip install -U selection

Dependencies

  • lxml
  • tools
  • six