diff options
author | Martin Fischer <martin@push-f.com> | 2023-09-26 08:22:21 +0200 |
---|---|---|
committer | Martin Fischer <martin@push-f.com> | 2023-09-28 10:57:11 +0200 |
commit | ee8ab781672e7ab608e74a5b605eb189828f0afe (patch) | |
tree | a5fcb14faf4dc2c29a4ed9f6a0c087acf8bcfb0d /src/tokenizer | |
parent | ade1034c46f3c5e50bdf07fd1556b7957011fe98 (diff) |
fix(tokenizer): don't lowercase temp chars in ScriptDataEndTagName
This bug resulted in e.g. "<script></SCRI" being wrongly tokenized as:
StartTag(StartTag { name: "script", self_closing: false, attributes: {} })
Char('<')
Char('/')
Char('s')
Char('c')
Char('r')
Char('i')
EndOfFile
Note that the Char tokens should be uppercase. (This bug could only be
observed when properly doing state switching via tree construction.)
Diffstat (limited to 'src/tokenizer')
-rw-r--r-- | src/tokenizer/machine.rs | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/src/tokenizer/machine.rs b/src/tokenizer/machine.rs index 944eb01..100f645 100644 --- a/src/tokenizer/machine.rs +++ b/src/tokenizer/machine.rs @@ -428,7 +428,7 @@ where } Some(x) if x.is_ascii_alphabetic() => { slf.push_tag_name(ctostr!(x.to_ascii_lowercase())); - slf.temporary_buffer.push(x.to_ascii_lowercase()); + slf.temporary_buffer.push(x); Ok(ControlToken::Continue) } c => { |