rusthtml

A html parser written in rust


Keywords
parser, html
License
GPL-3.0-only

Documentation

RustHtml - a small html parser

About The Project

RustHtml is a small html parser written in rust.

Warning: this library is not production-ready. Many of the tags and standard are not implemented (mainly located in tag_optimize()). PRs are welcomed.

Completed & planned features:

  • Parse simple html
  • Parse html with void elements
  • Parse html with javascript
  • Parse html with complicated elements (such as html without head ending tag, etc)

Benchmarking

Run cargo bench to benchmark the program.

On my local device, parsing a 43833 lines html requires 18.697 ms to complete

License

Distributed under the GPL-3.0-Only License. See LICENSE for more information.