fix(tokenizer): don't lowercase temp chars in ScriptDataEndTagName

This bug resulted in e.g. "<script></SCRI" being wrongly tokenized as: StartTag(StartTag { name: "script", self_closing: false, attributes: {} }) Char('<') Char('/') Char('s') Char('c') Char('r') Char('i') EndOfFile Note that the Char tokens should be uppercase. (This bug could only be observed when properly doing state switching via tree construction.)
author: Martin Fischer <martin@push-f.com> 2023-09-26 08:22:21 +0200
committer: Martin Fischer <martin@push-f.com> 2023-09-28 10:57:11 +0200
commit: ee8ab781672e7ab608e74a5b605eb189828f0afe (patch)
tree: a5fcb14faf4dc2c29a4ed9f6a0c087acf8bcfb0d /src/tokenizer
parent: ade1034c46f3c5e50bdf07fd1556b7957011fe98 (diff)
1 files changed, 1 insertions, 1 deletions
diff --git a/src/tokenizer/machine.rs b/src/tokenizer/machine.rs
index 944eb01..100f645 100644
--- a/src/tokenizer/machine.rs
+++ b/src/tokenizer/machine.rs
@@ -428,7 +428,7 @@ where
             }
             Some(x) if x.is_ascii_alphabetic() => {
                 slf.push_tag_name(ctostr!(x.to_ascii_lowercase()));
-                slf.temporary_buffer.push(x.to_ascii_lowercase());
+                slf.temporary_buffer.push(x);
                 Ok(ControlToken::Continue)
             }
             c => {
author	Martin Fischer <martin@push-f.com>	2023-09-26 08:22:21 +0200
committer	Martin Fischer <martin@push-f.com>	2023-09-28 10:57:11 +0200
commit	ee8ab781672e7ab608e74a5b605eb189828f0afe (patch)
tree	a5fcb14faf4dc2c29a4ed9f6a0c087acf8bcfb0d /src/tokenizer
parent	ade1034c46f3c5e50bdf07fd1556b7957011fe98 (diff)