Standard Compression Scheme for Unicode: Difference between revisions

Content deleted Content added
KolbertBot (talk | contribs)
m Bot: HTTP→HTTPS (v485)
This seems like no compression at all, if 50% is best possible. "Unicode values" is misleading here, meaning USC-2 (precursor to UTF-16) for SQL Server. Jus going with the source. Newer versions probably support full UTF-16, just not sure if they use this compression.
Line 6:
[[Reuters]], the organization that floated the first draft of SCSU, is believed to use SCSU internally.
 
[[Microsoft SQL Server|SQL Server 2008 R2]] uses SCSU to compress Unicode values (there meaning [[UCS-2]]) stored in ''nchar(n)'' and ''nvarchar(n)'' columns, achieving space savings between 15% and 50% (while [[UTF-8]] already has this 50% reduction for [[ASCII]] subset of Unicode), depending on the language of the data.<ref>{{cite web|url=http://msdn.microsoft.com/en-us/library/ee240835(SQL.105).aspx|title=Unicode Compression Implementation (SQL Server 2008 R2 Books Online)|accessdate=2008-08-18}}</ref>
 
== Comparison with general-purpose plain text compression schemes ==