diff options
author | Martin Fischer <martin@push-f.com> | 2021-04-08 15:27:23 +0200 |
---|---|---|
committer | Martin Fischer <martin@push-f.com> | 2021-04-08 15:57:18 +0200 |
commit | 18a022579737acc599277258fc4a8472ba73a8b0 (patch) | |
tree | 908aa0f40c2be0d8a1ed305d021a256383221f8b /README.md | |
parent | 7c1b82bf4bac750b7d7311ee29625e40e4441aef (diff) |
rebrand as html5tokenizer
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 26 |
1 files changed, 26 insertions, 0 deletions
diff --git a/README.md b/README.md new file mode 100644 index 0000000..531358e --- /dev/null +++ b/README.md @@ -0,0 +1,26 @@ +# html5tokenizer + +This crate provides the tokenizer form [html5ever](https://crates.io/crates/html5ever), +repackaged with all of its dependencies removed. The following dependencies were removed: + +* [markup5ever](https://crates.io/crates/markup5ever) + `buffer_queue` and `smallcharset` were merged into the source code + +* [tendril](https://crates.io/crates/tendril) + According to its README it contains "a substantial amount of unsafe code". + This fork replaces the tendril strings with plain old `std::string::String`s. + +* [mac](https://crates.io/crates/mac) + The only macros actually needed (`format_if` and `test_eq`) were merged into + the source code. + +* [log](https://crates.io/crates/log) + Was only used for debug output. + +If you want to parse HTML into a tree (DOM) you should by all means use +html5ever, this crate is merely for those who only want an HTML5 tokenizer and +seek to minimize their compile dependencies (html5ever pulls in 56). + +## Credits + +Thanks to the developers of html5ever for their awesome parser! |