summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorMartin Fischer <martin@push-f.com>2023-09-03 12:48:33 +0200
committerMartin Fischer <martin@push-f.com>2023-09-03 23:00:04 +0200
commit427a31f8a076d0fea156f45b160c08e3c0a53bef (patch)
tree0d4ff6898aee0ba05e0eee09b00763be17b7f9e4
parenta2be091994247181086eb34dcda0857bd5435fe4 (diff)
docs: add 'Compliance & testing' section to readme
-rw-r--r--README.md9
1 files changed, 6 insertions, 3 deletions
diff --git a/README.md b/README.md
index 61801d1..b60bdfa 100644
--- a/README.md
+++ b/README.md
@@ -43,6 +43,11 @@ assert_eq!(new_html, "<title>hello world</title>");
* This crate does not yet implement spans for character tokens.
+## Compliance & testing
+
+The tokenizer passes the [html5lib tokenizer test suite].
+The library is not yet fuzz tested.
+
## Compared to html5gum
`html5tokenizer` was forked from [html5gum] 0.2.1.
@@ -52,8 +57,6 @@ assert_eq!(new_html, "<title>hello world</title>");
For details please refer to the [changelog].
-Both crates pass the [html5lib tokenizer test suite].
-
Both crates have an `Emitter` trait that lets you bring your own token data
structure and hook into token creation by implementing the `Emitter` trait.
This allows you to:
@@ -72,7 +75,7 @@ Licensed under the MIT license, see [the LICENSE file].
[parsing model]: https://html.spec.whatwg.org/multipage/parsing.html#overview-of-the-parsing-model
[character encoding detection]: https://html.spec.whatwg.org/multipage/parsing.html#determining-the-character-encoding
-[html5gum]: https://crates.io/crates/html5gum
[html5lib tokenizer test suite]: https://github.com/html5lib/html5lib-tests/tree/master/tokenizer
+[html5gum]: https://crates.io/crates/html5gum
[changelog]: ./CHANGELOG.md
[the LICENSE file]: ./LICENSE