aboutsummaryrefslogtreecommitdiff
path: root/data/bench/small-fragment.html
diff options
context:
space:
mode:
Diffstat (limited to 'data/bench/small-fragment.html')
-rw-r--r--data/bench/small-fragment.html7
1 files changed, 7 insertions, 0 deletions
diff --git a/data/bench/small-fragment.html b/data/bench/small-fragment.html
new file mode 100644
index 0000000..a0b9643
--- /dev/null
+++ b/data/bench/small-fragment.html
@@ -0,0 +1,7 @@
+<p>In July 1992, the <a href="http://en.wikipedia.org/wiki/X/Open" title="X/Open">X/Open</a> committee XoJIG was looking for a better encoding. Dave Prosser of <a href="http://en.wikipedia.org/wiki/Unix_System_Laboratories" title="Unix System Laboratories">Unix System Laboratories</a>
+ submitted a proposal for one that had faster implementation
+characteristics and introduced the improvement that 7-bit ASCII
+characters would <i>only</i> represent themselves; all multibyte
+sequences would include only bytes where the high bit was set. This
+original proposal, FSS-UTF (File System Safe UCS Transformation Format),
+ was similar in concept to UTF-8, but lacked the crucial property of <a href="http://en.wikipedia.org/wiki/Self-synchronizing_code" title="Self-synchronizing code">self-synchronization</a>.