From 57e7eefcbe6fb8c3dc4b01c707be9de4c34963a7 Mon Sep 17 00:00:00 2001 From: Martin Fischer Date: Thu, 8 Apr 2021 08:42:01 +0200 Subject: import https://github.com/servo/html5ever commit d1206daa740305f55a5fa159e43eb33afc359cb4 --- data/bench/small-fragment.html | 7 +++++++ 1 file changed, 7 insertions(+) create mode 100644 data/bench/small-fragment.html (limited to 'data/bench/small-fragment.html') diff --git a/data/bench/small-fragment.html b/data/bench/small-fragment.html new file mode 100644 index 0000000..a0b9643 --- /dev/null +++ b/data/bench/small-fragment.html @@ -0,0 +1,7 @@ +

In July 1992, the X/Open committee XoJIG was looking for a better encoding. Dave Prosser of Unix System Laboratories + submitted a proposal for one that had faster implementation +characteristics and introduced the improvement that 7-bit ASCII +characters would only represent themselves; all multibyte +sequences would include only bytes where the high bit was set. This +original proposal, FSS-UTF (File System Safe UCS Transformation Format), + was similar in concept to UTF-8, but lacked the crucial property of self-synchronization. -- cgit v1.2.3