From 664652212c3efc05bf70c39ee00c718d51f33bac Mon Sep 17 00:00:00 2001 From: Markus Unterwaditzer Date: Sun, 28 Nov 2021 00:09:29 +0100 Subject: update wording again --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 4f9d066..0894c9b 100644 --- a/README.md +++ b/README.md @@ -32,7 +32,7 @@ assert_eq!(new_html, "hello world"); It fully implements [13.2.5 of the WHATWG HTML spec](https://html.spec.whatwg.org/#tokenization), i.e. is able to tokenize HTML documents and passes [html5lib's tokenizer -test suite](https://github.com/html5lib/html5lib-tests/tree/master/tokenizer), with the following exceptions: +test suite](https://github.com/html5lib/html5lib-tests/tree/master/tokenizer). Since it is just a tokenizer, this means: * html5gum **does not** [implement charset detection.](https://html.spec.whatwg.org/#determining-the-character-encoding) This implementation requires all input to be Rust strings and therefore valid UTF-8. -- cgit v1.2.3