Short Story

Dasar teknologi mendeteksi kombinasi dari 96 pasangan bahasa pengkodean yang melibatkan 40 bahasa yang berbeda dan 30 jenis pengkodean yang unik:



Language
Encoding
Albanian UTF-8, Windows-1252
Arabic UTF-8, Windows-1256, ISO-8859-6
Bahasa Indonesia UTF-8, Windows-1252
Bahasa Malay UTF-8, Windows-1252
Bulgarian UTF-8, Windows-1251, ISO-8859-5, KOI8-R
Catalan UTF-8, Windows-1252
Chinese UTF-8, GB-2312, HZ-GB-2312, ISO-2022-CN
Chinese UTF-8, Big5
Croatian UTF-8, Windows-1250
Czech UTF-8, Windows-1250
Danish UTF-8, Windows-1252
Dutch UTF-8, Windows-1252
English UTF-8, Windows-1252
Estonian UTF-8, Windows-1257
Farsi UTF-8, Windows-1256
Finnish UTF-8, Windows-1252
French UTF-8, Windows-1252
German UTF-8, Windows-1252
Greek UTF-8, Windows-1253
Hebrew UTF-8, Windows-1255
Hungarian UTF-8, Windows-1250
Icelandic UTF-8, Windows-1252
Italian UTF-8, Windows-1252
Japanese UTF-8, EUC-JP, ISO-2022-JP, Shift-JIS
Korean UTF-8, EUC-KR, ISO-2022-KR
Latvian UTF-8, Windows-1257
Lithuanian UTF-8, Windows-1257
Norwegian UTF-8, Windows-1252
Polish UTF-8, Windows-1250
Portuguese UTF-8, Windows-1252
Romanian UTF-8, Windows-1250
Russian UTF-8, Windows-1251, ISO-8859-5, IBM-866, KOI8-R, x-Mac-Cyrillic
Slovak UTF-8, Windows-1250
Slovenian UTF-8, Windows-1250
Spanish UTF-8, Windows-1252
Swedish UTF-8, Windows-1252
Tagalog UTF-8, Windows-1252
Thai UTF-8, Windows-874
Turkish UTF-8, Windows-1254
Vietnamese UTF-8, VISCII, VPS, VIQR, TCVN, VNI

Sumber: http://www.mkbergman.com/195/tutorial-internet-languages-character-sets-and-encodings/