diff options
Diffstat (limited to 'data/bench/small-fragment.html')
-rw-r--r-- | data/bench/small-fragment.html | 7 |
1 files changed, 7 insertions, 0 deletions
diff --git a/data/bench/small-fragment.html b/data/bench/small-fragment.html new file mode 100644 index 0000000..a0b9643 --- /dev/null +++ b/data/bench/small-fragment.html @@ -0,0 +1,7 @@ +<p>In July 1992, the <a href="http://en.wikipedia.org/wiki/X/Open" title="X/Open">X/Open</a> committee XoJIG was looking for a better encoding. Dave Prosser of <a href="http://en.wikipedia.org/wiki/Unix_System_Laboratories" title="Unix System Laboratories">Unix System Laboratories</a> + submitted a proposal for one that had faster implementation +characteristics and introduced the improvement that 7-bit ASCII +characters would <i>only</i> represent themselves; all multibyte +sequences would include only bytes where the high bit was set. This +original proposal, FSS-UTF (File System Safe UCS Transformation Format), + was similar in concept to UTF-8, but lacked the crucial property of <a href="http://en.wikipedia.org/wiki/Self-synchronizing_code" title="Self-synchronizing code">self-synchronization</a>. |