Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-08-19 | break!: remove StartTag::next_state | Martin Fischer | |
You shouldn't manually have to match tokens yielded by the tokenizer iterator just to correctly handle state transitions. A better NaiveParser API will be introduced. | |||
2023-08-19 | break!: remove set_last_start_tag from Emitter | Martin Fischer | |
2023-08-19 | refactor: move html5lib test to own crate to fix `cargo test` | Martin Fischer | |
Previously `cargo test` failed because it ran the test_html5lib integration test, which depends on the integration-tests feature (so you always had to run `cargo test` with `--features integration-tests` or `--all-features`, which was annoying). This commit moves the integration tests to another crate, so that the dependency on the feature can be properly defined in a way so that `cargo test` just works and runs the test. | |||
2023-08-19 | chore: drop test-generator dev-dependency | Martin Fischer | |
I want to move the test_html5lib integration test to a separate crate so that it can properly depend on the integration-tests feature in a way so that `cargo test` just works and runs the integration test. (Currently `cargo test` fails since test_html5lib depends on that feature.) However test_html5lib currently depends on the test-generator crate and test-generator doesn't support Cargo workspaces[1] and appears to be unmaintained. This commit therefore drops the test-generator dev-dependency. [1]: https://github.com/frehberg/test-generator/issues/6 | |||
2021-12-05 | rename to html5tokenizer, bump versionv0.4.0 | Martin Fischer | |
2021-12-05 | spans: get rid of code duplication by introducing Span trait | Martin Fischer | |
2021-12-05 | spans: refactor to avoid one clone() | Martin Fischer | |
2021-12-05 | rename internal emit_error to push_error (to avoid confusion with trait method) | Martin Fischer | |
2021-12-05 | improve duplicate attribute span | Martin Fischer | |
2021-12-05 | refactor: match btree_map::Entry instead of using and_modify closure | Martin Fischer | |
2021-12-05 | spans: slightly refactor DefaultEmitter | Martin Fischer | |
2021-12-05 | spans: add spans to Token::Error | Martin Fischer | |
2021-12-05 | spans: fix spans for quoted attribute values | Martin Fischer | |
2021-12-05 | spans: support attribute values | Martin Fischer | |
2021-12-05 | spans: support attribute names | Martin Fischer | |
2021-12-05 | spans: add span tests | Martin Fischer | |
2021-12-05 | spans: start implementing SpanEmitter | Martin Fischer | |
2021-12-05 | spans: introduce PosTracker | Martin Fischer | |
2021-12-05 | spans: introduce GetPos trait | Martin Fischer | |
2021-12-05 | spans: rename to SpanEmitter, adjust generics | Martin Fischer | |
2021-12-05 | spans: copy DefaultEmitter to new span module | Martin Fischer | |
2021-12-05 | spans: make Emitter generic over Reader | Martin Fischer | |
2021-12-05 | spans: make Emitter generic over Span | Martin Fischer | |
2021-12-05 | fix wrong state transition in ScriptDataLessThanSign state | Martin Fischer | |
Before the following happened: % printf '<script><b>test</b></script>' | cargo run --example=switch-state StartTag(StartTag { self_closing: false, name: "script", attributes: {} }) String("<b>test") EndTag(EndTag { name: "b" }) EndTag(EndTag { name: "script" }) Which is obviously wrong. After a <script> tag we want to switch to the ScriptData state (instead of the Data state). This commit fixes this implementation error, making the above command produce the expected output of: StartTag(StartTag { self_closing: false, name: "script", attributes: {} }) String("<b>test</b>") EndTag(EndTag { name: "script" }) | |||
2021-12-05 | introduce StartTag::next_state | Martin Fischer | |
Closes #11. | |||
2021-12-05 | allow setting the Tokenizer to Data, PlainText, RcData, RawText and ↵ | Martin Fischer | |
ScriptData states | |||
2021-12-05 | prepare for introduction of public State enum | Martin Fischer | |
2021-12-03 | fix new clippy | Markus Unterwaditzer | |
2021-12-03 | Fix typo and add example (#9) | Martin Fischer | |
2021-11-28 | version 0.2.1html5gum-0.2.1 | Markus Unterwaditzer | |
2021-11-28 | update wording | Markus Unterwaditzer | |
2021-11-28 | restructure readme | Markus Unterwaditzer | |
2021-11-28 | add another example | Markus Unterwaditzer | |
2021-11-28 | update wording again | Markus Unterwaditzer | |
2021-11-28 | fix period | Markus Unterwaditzer | |
2021-11-28 | update wording again | Markus Unterwaditzer | |
2021-11-28 | fix wording | Markus Unterwaditzer | |
2021-11-28 | clarify what html5gum isn't, fix #5 | Markus Unterwaditzer | |
2021-11-27 | fix crash in try_read_string | Markus Unterwaditzer | |
2021-11-27 | split up match-arms and tokenizer to isolate some tokenizer-internal state | Markus Unterwaditzer | |
purpose: don't want to expose self.to_reconsume to the consume() method | |||
2021-11-26 | version 0.2.0html5gum-0.2.0 | Markus Unterwaditzer | |
2021-11-26 | Read html from io::BufRead (#8) | Markus Unterwaditzer | |
2021-11-26 | update ci to take care of broken doc refs | Markus Unterwaditzer | |
2021-11-26 | clean up reader interface | Markus Unterwaditzer | |
2021-11-26 | run cargo-diet | Markus Unterwaditzer | |
2021-11-26 | check out submodules | Markus Unterwaditzer | |
2021-11-26 | re-add html5lib-tests | Markus Unterwaditzer | |
2021-11-26 | remove html5lib-tests | Markus Unterwaditzer | |
2021-11-26 | Update dependabot.yml | Markus Unterwaditzer | |
2021-11-26 | Create dependabot.yml | Markus Unterwaditzer | |