Content deleted Content added
→Comparison with external compression schemes: There is not only one "real problem" with SCSU |
removed dead external link |
||
Line 1:
{{Table Unicode}}
The '''Standard Compression Scheme for Unicode''' (SCSU) is a [[Unicode]] Technical Standard<ref>{{cite web |url=http://www.unicode.org/reports/tr6/
[[Reuters]], the organization that floated the first draft of SCSU, is believed to use SCSU internally.
Line 9 ⟶ 10:
SCSU has not been a resounding success. Few applications need to compress so much Unicode text that it's worth using a special-purpose compression scheme which does not have widespread support. Also, while it can be used as a text encoding, it can be difficult to handle internally.
Treated purely as a compression algorithm, SCSU is inferior to most commonly-used general-purpose algorithms for texts of over a few kilobytes. One of several problems with SCSU is then that the savings of SCSU versus [[UTF-16]] or [[UTF-8]] drop after external compression<ref>
|date=2004-01-30 |first=Doug |last=Ewell |accessdate=2008-06-13 |format=PDF }}</ref>, often dramatically so.
SCSU does have the advantage that it can usefully compress texts that are only a few characters long, whereas most full-scale compressors need a few kilobytes of data to break even against their own overhead.
Line 17 ⟶ 19:
* [[International Components for Unicode]] A library that can convert between SCSU and other Unicode encodings
==
<references />
[[Category:Unicode]]
|