Share via


Expected names of Microsoft Windows "ANSI" Code Pages (Encodings)

I was asked about our use of the windows "ansi" code page names, as used in things like MIME types, http content-type tags, etc.  Each "code page" has a name that most accuratly round trips back to the same code page, which I've listed as the "preferred name" below.  Additionally, when you ask for a code page matching a name, some code pages have several aliases that map to the identical behavior.  (listed as "aliases" in the table below).

Note that there are quite a few inconsistencies and other odd behaviors.  Some have names of windows-xxx, and others don't even recognize that form as an alias.  Additionally some reference material I've seen refers to any microsoft code page in the windows-xxxx or CPxxx form, whether or not windows itself recognizes those names. 

We have no intention of trying to create a more consistent naming scheme, we prefer that applications use Unicode.

So in practice, you should use the "preferred name" to identify data tagged by a particular code page, but if you are accepting input data, recognize that it may also use one of the listed aliases.  Encoding.GetEncoding() should "do the right thing".  You might want to look at my previous posts "Encoding.GetEncodings() has a couple "duplicate" names" and "What's my Encoding Called?".

Code Page 874 (ANSI/OEM - Thai)

Preferred Name:

windows-874

Aliases:

DOS-874

iso-8859-11

TIS-620

Code Page 932 (ANSI/OEM - Japanese Shift-JIS)

Preferred Name:

shift_jis

Aliases:

csShiftJIS

csWindows31J

ms_Kanji

shift-jis

sjis

x-ms-cp932

x-sjis

Code Page 936 (ANSI/OEM - Simplified Chinese GBK)

Preferred Name:

gb2312

Aliases:

chinese

CN-GB

csGB2312

csGB231280

csISO58GB231280

GB2312-80

GB231280

GBK

GB_2312-80

iso-ir-58

Code Page 949 (ANSI/OEM - Korean)

Preferred Name:

ks_c_5601-1987

Aliases:

csKSC56011987

iso-ir-149

korean

ks-c-5601

ks-c5601

KSC5601

KSC_5601

ks_c_5601

ks_c_5601-1989

ks_c_5601_1987

Code Page 950 (ANSI/OEM - Traditional Chinese Big5)

Preferred Name:

big5

Aliases:

Big5-HKSCS

cn-big5

csbig5

x-x-big5

Code Page 1250 (ANSI - Central Europe)

Preferred Name:

windows-1250

Aliases:

x-cp1250

Code Page 1251 (ANSI - Cyrillic)

Preferred Name:

windows-1251

Aliases:

x-cp1251

Code Page 1252 (ANSI - Latin I)

Preferred Name:

Windows-1252

Aliases:

x-ansi

Code Page 1253 (ANSI - Greek)

Preferred Name:

windows-1253

Code Page 1254 (ANSI - Turkish)

Preferred Name:

windows-1254

Code Page 1255 (ANSI - Hebrew)

Preferred Name:

windows-1255

Code Page 1256 (ANSI - Arabic)

Preferred Name:

windows-1256

Aliases:

cp1256

Code Page 1257 (ANSI - Baltic)

Preferred Name:

windows-1257

Code Page 1258 (ANSI/OEM - Viet Nam)

Preferred Name:

windows-1258