UTF-C Demo

UTF-C offers a compact way to store Unicode strings. Using the text field below you can test it on any text and compare its effectiveness to UTF-8 and SCSU. You can read more about the algorithm on Github or inspect the source code directly.

Try a sample text:EnglishHebrewGreekThaiFrenchUkrainianVietnameseJapaneseRussianKazakhTurkishCzechChineseKoreanGeorgianArabicPersianZalgoEmoji
95 code points
UTF-8 (157 bytes): 5554462D4320737570706F7274732061show more »
SCSU (149 bytes): 5554462D4320737570706F7274732061show more »
UTF-C (117 bytes): 5554462D4320737570706F7274732061show more »
25.5% better than UTF-8
20.4% better than SCSU
Decoded back as: UTF-C supports all Unicode characters: русский, العربية, 한국어, עברית, ქართული, Ελληνικά, 日本語 🔥🎉👍 (95 code points, matches input)