SaxTract
Python SAX parser to extract xml
- Free software: MIT license
- Documentation: https://saxtract.readthedocs.io
Features
Uses a SAXParser to maintain a fix memory footprint to parse and 'extract' tags from an xml file and push it to an output stream.
With performance tests on a trimmed down to 10k records from the dbpl dataset, SaxTrack ran in about half the time and half the memory footprint
- TODO - allow xsd/dtd input for validation
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
The main parser code was copied from tutorialspoint