diff options
| author | Martin Fischer <martin@push-f.com> | 2023-09-26 08:22:21 +0200 | 
|---|---|---|
| committer | Martin Fischer <martin@push-f.com> | 2023-09-28 10:57:11 +0200 | 
| commit | ee8ab781672e7ab608e74a5b605eb189828f0afe (patch) | |
| tree | a5fcb14faf4dc2c29a4ed9f6a0c087acf8bcfb0d /src/tokenizer | |
| parent | ade1034c46f3c5e50bdf07fd1556b7957011fe98 (diff) | |
fix(tokenizer): don't lowercase temp chars in ScriptDataEndTagName
This bug resulted in e.g. "<script></SCRI" being wrongly tokenized as:
    StartTag(StartTag { name: "script", self_closing: false, attributes: {} })
    Char('<')
    Char('/')
    Char('s')
    Char('c')
    Char('r')
    Char('i')
    EndOfFile
Note that the Char tokens should be uppercase. (This bug could only be
observed when properly doing state switching via tree construction.)
Diffstat (limited to 'src/tokenizer')
| -rw-r--r-- | src/tokenizer/machine.rs | 2 | 
1 files changed, 1 insertions, 1 deletions
| diff --git a/src/tokenizer/machine.rs b/src/tokenizer/machine.rs index 944eb01..100f645 100644 --- a/src/tokenizer/machine.rs +++ b/src/tokenizer/machine.rs @@ -428,7 +428,7 @@ where              }              Some(x) if x.is_ascii_alphabetic() => {                  slf.push_tag_name(ctostr!(x.to_ascii_lowercase())); -                slf.temporary_buffer.push(x.to_ascii_lowercase()); +                slf.temporary_buffer.push(x);                  Ok(ControlToken::Continue)              }              c => { | 
