aboutsummaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorMartin Fischer <martin@push-f.com>2021-04-08 15:27:23 +0200
committerMartin Fischer <martin@push-f.com>2021-04-08 15:57:18 +0200
commit18a022579737acc599277258fc4a8472ba73a8b0 (patch)
tree908aa0f40c2be0d8a1ed305d021a256383221f8b /README.md
parent7c1b82bf4bac750b7d7311ee29625e40e4441aef (diff)
rebrand as html5tokenizer
Diffstat (limited to 'README.md')
-rw-r--r--README.md26
1 files changed, 26 insertions, 0 deletions
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..531358e
--- /dev/null
+++ b/README.md
@@ -0,0 +1,26 @@
+# html5tokenizer
+
+This crate provides the tokenizer form [html5ever](https://crates.io/crates/html5ever),
+repackaged with all of its dependencies removed. The following dependencies were removed:
+
+* [markup5ever](https://crates.io/crates/markup5ever)
+ `buffer_queue` and `smallcharset` were merged into the source code
+
+* [tendril](https://crates.io/crates/tendril)
+ According to its README it contains "a substantial amount of unsafe code".
+ This fork replaces the tendril strings with plain old `std::string::String`s.
+
+* [mac](https://crates.io/crates/mac)
+ The only macros actually needed (`format_if` and `test_eq`) were merged into
+ the source code.
+
+* [log](https://crates.io/crates/log)
+ Was only used for debug output.
+
+If you want to parse HTML into a tree (DOM) you should by all means use
+html5ever, this crate is merely for those who only want an HTML5 tokenizer and
+seek to minimize their compile dependencies (html5ever pulls in 56).
+
+## Credits
+
+Thanks to the developers of html5ever for their awesome parser!