diff options
author | Markus Unterwaditzer <markus-honeypot@unterwaditzer.net> | 2021-11-28 00:09:29 +0100 |
---|---|---|
committer | Markus Unterwaditzer <markus-honeypot@unterwaditzer.net> | 2021-11-28 00:09:29 +0100 |
commit | 664652212c3efc05bf70c39ee00c718d51f33bac (patch) | |
tree | 40a6eab2b697d44526fe70f3bd5630ac6a4c8570 | |
parent | 5fac4df0b90b9634b4678bdf41053fee525ebbc0 (diff) |
update wording again
-rw-r--r-- | README.md | 2 |
1 files changed, 1 insertions, 1 deletions
@@ -32,7 +32,7 @@ assert_eq!(new_html, "<title>hello world</title>"); It fully implements [13.2.5 of the WHATWG HTML spec](https://html.spec.whatwg.org/#tokenization), i.e. is able to tokenize HTML documents and passes [html5lib's tokenizer -test suite](https://github.com/html5lib/html5lib-tests/tree/master/tokenizer), with the following exceptions: +test suite](https://github.com/html5lib/html5lib-tests/tree/master/tokenizer). Since it is just a tokenizer, this means: * html5gum **does not** [implement charset detection.](https://html.spec.whatwg.org/#determining-the-character-encoding) This implementation requires all input to be Rust strings and therefore valid UTF-8. |