aboutsummaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorMarkus Unterwaditzer <markus-honeypot@unterwaditzer.net>2021-11-24 21:12:29 +0100
committerMarkus Unterwaditzer <markus-honeypot@unterwaditzer.net>2021-11-24 21:12:29 +0100
commit2988a46b6b3e8b82fcd8cc5211a74fc4bb7538b9 (patch)
treea0c67c142cd48fc2509b7efc6f89cd572c62b2ed /README.md
parent3da2052fa180ce5cac0f128382481b630164b9b4 (diff)
link issue
Diffstat (limited to 'README.md')
-rw-r--r--README.md10
1 files changed, 7 insertions, 3 deletions
diff --git a/README.md b/README.md
index d7cbf50..1763706 100644
--- a/README.md
+++ b/README.md
@@ -33,9 +33,13 @@ assert_eq!(new_html, "<title>hello world</title>");
It fully implements [13.2 of the WHATWG HTML
spec](https://html.spec.whatwg.org/#parsing) and passes [html5lib's tokenizer
test suite](https://github.com/html5lib/html5lib-tests/tree/master/tokenizer),
-except that this implementation requires all input to be Rust strings and
-therefore valid UTF-8. There is no charset detection or handling of invalid
-surrogates, and the relevant html5lib tests are skipped in CI.
+except that:
+
+* this implementation requires all input to be Rust strings and therefore valid
+ UTF-8. There is no charset detection or handling of invalid surrogates, and
+ the relevant html5lib tests are skipped in CI.
+
+* there's some remaining testcases to be decided on at [issue 5](https://github.com/untitaker/html5gum/issues/5).
A distinguishing feature of `html5gum` is that you can bring your own token
datastructure and hook into token creation by implementing the `Emitter` trait.