saxtract

Python SAX parser to extract xml


Keywords
saxtract
License
MIT
Install
pip install saxtract==0.1.0

Documentation

SaxTract

Documentation Status Updates

Python SAX parser to extract xml

Features

Uses a SAXParser to maintain a fix memory footprint to parse and 'extract' tags from an xml file and push it to an output stream.

With performance tests on a trimmed down to 10k records from the dbpl dataset, SaxTrack ran in about half the time and half the memory footprint

  • TODO - allow xsd/dtd input for validation

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

The main parser code was copied from tutorialspoint