Help-Site Computer Manuals
Software
Hardware
Programming
Networking
  Algorithms & Data Structures   Programming Languages   Revision Control
  Protocols
  Cameras   Computers   Displays   Keyboards & Mice   Motherboards   Networking   Printers & Scanners   Storage
  Windows   Linux & Unix   Mac

I18N::Charset
IANA Character Set Registry names and Unicode::MapUTF8 conversion scheme names

I18N::Charset - IANA Character Set Registry names and Unicode::MapUTF8 conversion scheme names


NAME

I18N::Charset - IANA Character Set Registry names and Unicode::MapUTF8 (et al.) conversion scheme names


SYNOPSIS


  use I18N::Charset;

  $sCharset = iana_charset_name('WinCyrillic');

  # $sCharset is now 'windows-1251'

  $sCharset = umap_charset_name('Adobe DingBats');

  # $sCharset is now 'ADOBE-DINGBATS' which can be passed to Unicode::Map->new()

  $sCharset = map8_charset_name('windows-1251');

  # $sCharset is now 'cp1251' which can be passed to Unicode::Map8->new()

  $sCharset = umu8_charset_name('x-sjis');

  # $sCharset is now 'sjis' which can be passed to Unicode::MapUTF8->new()

  $sCharset = libi_charset_name('x-sjis');

  # $sCharset is now 'MS_KANJI' which can be passed to `iconv -f $sCharset ...`

  $sCharset = enco_charset_name('Shift-JIS');

  # $sCharset is now 'shiftjis' which can be passed to `Encode::from_to()`

  I18N::Charset::add_iana_alias('my-japanese' => 'iso-2022-jp');

  I18N::Charset::add_map8_alias('my-arabic' => 'arabic7');

  I18N::Charset::add_umap_alias('my-hebrew' => 'ISO-8859-8');

  I18N::Charset::add_libi_alias('my-sjis' => 'x-sjis');

  I18N::Charset::add_enco_alias('my-japanese' => 'shiftjis');


DESCRIPTION

The I18N::Charset module provides access to the IANA Character Set Registry names for identifying character encoding schemes. It also provides a mapping to the character set names used by the Unicode::Map8 and Unicode::Map modules.

So, for example, if you get an HTML document with a META CHARSET=``...'' tag, you can fairly quickly determine what Unicode::MapXXX module can be used to convert it to Unicode.

If you don't have the module Unicode::Map installed, the umap_ functions will always return undef. If you don't have the module Unicode::Map8 installed, the map8_ functions will always return undef. If you don't have the module Unicode::MapUTF8 installed, the umu8_ functions will always return undef. If you don't have the iconv library installed, the libi_ functions will always return undef. If you don't have the Encode module installed, the enco_ functions will always return undef.


CONVERSION ROUTINES

There are four main conversion routines: iana_charset_name(), map8_charset_name(), umap_charset_name(), and umu8_charset_name().

iana_charset_name()
This function takes a string containing the name of a character set and returns a string which contains the official IANA name of the character set identified. If no valid character set name can be identified, then undef will be returned. The case and punctuation within the string are not important.

    $sCharset = iana_charset_name('WinCyrillic');

umap_charset_name()
This function takes a string containing the name of a character set (in almost any format) and returns a string which contains a name for the character set that can be passed to Unicode::Map::new(). If no valid character set name can be identified, then undef will be returned. The case and punctuation within the argument string are not important.

    $sCharset = umap_charset_name('hebrew');


QUERY ROUTINES

There is one function which can be used to obtain a list of all IANA-registered character set names.

all_iana_charset_names()
Returns a list of all registered IANA character set names. The names are not in any particular order.


CHARACTER SET NAME ALIASING

This module supports several semi-private routines for specifying character set name aliases.

add_iana_alias()
This function takes two strings: a new alias, and a target IANA Character Set Name (or another alias). It defines the new alias to refer to that character set name (or to the character set name to which the second alias refers).

Returns the target character set name of the successfully installed alias. Returns 'undef' if the target character set name is not registered. Returns 'undef' if the target character set name of the second alias is not registered.


  I18N::Charset::add_iana_alias('my-alias1' => 'Shift_JIS');

With this code, ``my-alias1'' becomes an alias for the existing IANA character set name 'Shift_JIS'.


  I18N::Charset::add_iana_alias('my-alias2' => 'sjis');

With this code, ``my-alias2'' becomes an alias for the IANA character set name referred to by the existing alias 'sjis' (which happens to be 'Shift_JIS').

add_map8_alias()
This function takes two strings: a new alias, and a target Unicode::Map8 Character Set Name (or an exising alias to a Map8 name). It defines the new alias to refer to that mapping name (or to the mapping name to which the second alias refers).

If the first argument is a registered IANA character set name, then all aliases of that IANA character set name will end up pointing to the target Map8 mapping name.

Returns the target mapping name of the successfully installed alias. Returns 'undef' if the target mapping name is not registered. Returns 'undef' if the target mapping name of the second alias is not registered.


  I18N::Charset::add_map8_alias('normal' => 'ANSI_X3.4-1968');

With the above statement, ``normal'' becomes an alias for the existing Unicode::Map8 mapping name 'ANSI_X3.4-1968'.


  I18N::Charset::add_map8_alias('normal' => 'US-ASCII');

With the above statement, ``normal'' becomes an alias for the existing Unicode::Map mapping name 'ANSI_X3.4-1968' (which is what ``US-ASCII'' is an alias for).


  I18N::Charset::add_map8_alias('IBM297' => 'EBCDIC-CA-FR');

With the above statement, ``IBM297'' becomes an alias for the existing Unicode::Map mapping name 'EBCDIC-CA-FR'. As a side effect, all the aliases for 'IBM297' (i.e. 'cp297' and 'ebcdic-cp-fr') also become aliases for 'EBCDIC-CA-FR'.

add_umap_alias()
This function works identically to add_map8_alias() above, but operates on Unicode::Map encoding tables.

add_libi_alias()
This function takes two strings: a new alias, and a target iconv Character Set Name (or existing iconv alias). It defines the new alias to refer to that character set name (or to the character set name to which the existing alias refers).

Returns the target conversion scheme name of the successfully installed alias. Returns 'undef' if there is no such target conversion scheme or alias.

Examples:


  I18N::Charset::add_libi_alias('my-chinese1' => 'CN-GB');

With this code, ``my-chinese1'' becomes an alias for the existing iconv conversion scheme 'CN-GB'.


  I18N::Charset::add_libi_alias('my-chinese2' => 'EUC-CN');

With this code, ``my-chinese2'' becomes an alias for the iconv conversion scheme referred to by the existing alias 'EUC-CN' (which happens to be 'CN-GB').

add_enco_alias()
This function takes two strings: a new alias, and a target Encode encoding Name (or existing Encode alias). It defines the new alias referring to that encoding name (or to the encoding to which the existing alias refers).

Returns the target encoding name of the successfully installed alias. Returns 'undef' if there is no such encoding or alias.

Examples:


  I18N::Charset::add_enco_alias('my-japanese1' => 'jis0201-raw');

With this code, ``my-japanese1'' becomes an alias for the existing encoding 'jis0201-raw'.


  I18N::Charset::add_enco_alias('my-japanese2' => 'my-japanese1');

With this code, ``my-japanese2'' becomes an alias for the encoding referred to by the existing alias 'my-japanese1' (which happens to be 'jis0201-raw' after the previous call).


KNOWN BUGS AND LIMITATIONS

  • There could probably be many more aliases added (for convenience) to all the IANA names. If you have some specific recommendations, please email the author!

  • The only character set names which have a corresponding mapping in the Unicode::Map8 module are the character sets that Unicode::Map8 can convert.

    Similarly, the only character set names which have a corresponding mapping in the Unicode::Map module are the character sets that Unicode::Map can convert.

  • In the current implementation, all tables are read in and initialized when the module is loaded, and then held in memory until the program exits. A ``lazy'' implementation (or a less-portable tied hash) might lead to a shorter startup time. Suggestions, patches, comments are always welcome!


SEE ALSO

Unicode::Map
Convert strings from various multi-byte character encodings to and from Unicode.

Unicode::Map8
Convert strings from various 8-bit character encodings to and from Unicode.

Jcode
Convert strings among various Japanese character encodings and Unicode.

Unicode::MapUTF8
A wrapper around all three of these character set conversion distributions.


AUTHOR

Martin Thurn, mthurn@cpan.org, http://tinyurl.com/nn67z.


COPYRIGHT

Copyright (c) 1998-2006 Martin Thurn

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

(last updated 2007-05-14)

These are the official names for character sets that may be used in the Internet and may be referred to in Internet documentation. These names are expressed in ANSI_X3.4-1968 which is commonly called US-ASCII or simply ASCII. The character set most commonly use in the Internet and used especially in protocol standards is US-ASCII, this is strongly encouraged. The use of the name US-ASCII is also encouraged.

The character set names may be up to 40 characters taken from the printable characters of US-ASCII. However, no distinction is made between use of upper and lower case letters.

The MIBenum value is a unique value for use in MIBs to identify coded character sets.

The value space for MIBenum values has been divided into three regions. The first region (3-999) consists of coded character sets that have been standardized by some standard setting organization. This region is intended for standards that do not have subset implementations. The second region (1000-1999) is for the Unicode and ISO/IEC 10646 coded character sets together with a specification of a (set of) sub-repertoires that may occur. The third region (>1999) is intended for vendor specific coded character sets.


        Assigned MIB enum Numbers

        -------------------------

        0-2             Reserved

        3-999           Set By Standards Organizations

        1000-1999       Unicode / 10646

        2000-2999       Vendor

The aliases that start with ``cs'' have been added for use with the IANA-CHARSET-MIB as originally defined in RFC3808, and as currently maintained by IANA at http://www.iana.org/assignments/ianacharset-mib. Note that the ianacharset-mib needs to be kept in sync with this registry. These aliases that start with ``cs'' contain the standard numbers along with suggestive names in order to facilitate applications that want to display the names in user interfaces. The ``cs'' stands for character set and is provided for applications that need a lower case first letter but want to use mixed case thereafter that cannot contain any special characters, such as underbar (``_'') and dash (``-'').

If the character set is from an ISO standard, its cs alias is the ISO standard number or name. If the character set is not from an ISO standard, but is registered with ISO (IPSJ/ITSCJ is the current ISO Registration Authority), the ISO Registry number is specified as ISOnnn followed by letters suggestive of the name or standards number of the code set. When a national or international standard is revised, the year of revision is added to the cs alias of the new character set entry in the IANA Registry in order to distinguish the revised character set from the original character set.




Character Set                                               Reference

-------------                                               ---------

Name: ANSI_X3.4-1968 [RFC1345,KXS2] MIBenum: 3 Source: ECMA registry Alias: iso-ir-6 Alias: ANSI_X3.4-1986 Alias: ISO_646.irv:1991 Alias: ASCII Alias: ISO646-US Alias: US-ASCII (preferred MIME name) Alias: us Alias: IBM367 Alias: cp367 Alias: csASCII

Name: ISO_8859-1:1987 [RFC1345,KXS2] MIBenum: 4 Source: ECMA registry Alias: iso-ir-100 Alias: ISO_8859-1 Alias: ISO-8859-1 (preferred MIME name) Alias: latin1 Alias: l1 Alias: IBM819 Alias: CP819 Alias: csISOLatin1

Name: ISO_8859-2:1987 [RFC1345,KXS2] MIBenum: 5 Source: ECMA registry Alias: iso-ir-101 Alias: ISO_8859-2 Alias: ISO-8859-2 (preferred MIME name) Alias: latin2 Alias: l2 Alias: csISOLatin2

Name: ISO_8859-3:1988 [RFC1345,KXS2] MIBenum: 6 Source: ECMA registry Alias: iso-ir-109 Alias: ISO_8859-3 Alias: ISO-8859-3 (preferred MIME name) Alias: latin3 Alias: l3 Alias: csISOLatin3

Name: ISO_8859-4:1988 [RFC1345,KXS2] MIBenum: 7 Source: ECMA registry Alias: iso-ir-110 Alias: ISO_8859-4 Alias: ISO-8859-4 (preferred MIME name) Alias: latin4 Alias: l4 Alias: csISOLatin4

Name: ISO_8859-5:1988 [RFC1345,KXS2] MIBenum: 8 Source: ECMA registry Alias: iso-ir-144 Alias: ISO_8859-5 Alias: ISO-8859-5 (preferred MIME name) Alias: cyrillic Alias: csISOLatinCyrillic

Name: ISO_8859-6:1987 [RFC1345,KXS2] MIBenum: 9 Source: ECMA registry Alias: iso-ir-127 Alias: ISO_8859-6 Alias: ISO-8859-6 (preferred MIME name) Alias: ECMA-114 Alias: ASMO-708 Alias: arabic Alias: csISOLatinArabic

Name: ISO_8859-7:1987 [RFC1947,RFC1345,KXS2] MIBenum: 10 Source: ECMA registry Alias: iso-ir-126 Alias: ISO_8859-7 Alias: ISO-8859-7 (preferred MIME name) Alias: ELOT_928 Alias: ECMA-118 Alias: greek Alias: greek8 Alias: csISOLatinGreek

Name: ISO_8859-8:1988 [RFC1345,KXS2] MIBenum: 11 Source: ECMA registry Alias: iso-ir-138 Alias: ISO_8859-8 Alias: ISO-8859-8 (preferred MIME name) Alias: hebrew Alias: csISOLatinHebrew

Name: ISO_8859-9:1989 [RFC1345,KXS2] MIBenum: 12 Source: ECMA registry Alias: iso-ir-148 Alias: ISO_8859-9 Alias: ISO-8859-9 (preferred MIME name) Alias: latin5 Alias: l5 Alias: csISOLatin5

Name: ISO-8859-10 (preferred MIME name) [RFC1345,KXS2] MIBenum: 13 Source: ECMA registry Alias: iso-ir-157 Alias: l6 Alias: ISO_8859-10:1992 Alias: csISOLatin6 Alias: latin6

Name: ISO_6937-2-add [RFC1345,KXS2] MIBenum: 14 Source: ECMA registry and ISO 6937-2:1983 Alias: iso-ir-142 Alias: csISOTextComm

Name: JIS_X0201 [RFC1345,KXS2] MIBenum: 15 Source: JIS X 0201-1976. One byte only, this is equivalent to JIS/Roman (similar to ASCII) plus eight-bit half-width Katakana Alias: X0201 Alias: csHalfWidthKatakana

Name: JIS_Encoding MIBenum: 16 Source: JIS X 0202-1991. Uses ISO 2022 escape sequences to shift code sets as documented in JIS X 0202-1991. Alias: csJISEncoding

Name: Shift_JIS (preferred MIME name) MIBenum: 17 Source: This charset is an extension of csHalfWidthKatakana by adding graphic characters in JIS X 0208. The CCS's are JIS X0201:1997 and JIS X0208:1997. The complete definition is shown in Appendix 1 of JIS X0208:1997. This charset can be used for the top-level media type ``text''. Alias: MS_Kanji Alias: csShiftJIS

Name: Extended_UNIX_Code_Packed_Format_for_Japanese MIBenum: 18 Source: Standardized by OSF, UNIX International, and UNIX Systems Laboratories Pacific. Uses ISO 2022 rules to select code set 0: US-ASCII (a single 7-bit byte set) code set 1: JIS X0208-1990 (a double 8-bit byte set) restricted to A0-FF in both bytes code set 2: Half Width Katakana (a single 7-bit byte set) requiring SS2 as the character prefix code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes requiring SS3 as the character prefix Alias: csEUCPkdFmtJapanese Alias: EUC-JP (preferred MIME name)

Name: Extended_UNIX_Code_Fixed_Width_for_Japanese MIBenum: 19 Source: Used in Japan. Each character is 2 octets. code set 0: US-ASCII (a single 7-bit byte set) 1st byte = 00 2nd byte = 20-7E code set 1: JIS X0208-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes code set 2: Half Width Katakana (a single 7-bit byte set) 1st byte = 00 2nd byte = A0-FF code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in the first byte and 21-7E in the second byte Alias: csEUCFixWidJapanese

Name: BS_4730 [RFC1345,KXS2] MIBenum: 20 Source: ECMA registry Alias: iso-ir-4 Alias: ISO646-GB Alias: gb Alias: uk Alias: csISO4UnitedKingdom

Name: SEN_850200_C [RFC1345,KXS2] MIBenum: 21 Source: ECMA registry Alias: iso-ir-11 Alias: ISO646-SE2 Alias: se2 Alias: csISO11SwedishForNames

Name: IT [RFC1345,KXS2] MIBenum: 22 Source: ECMA registry Alias: iso-ir-15 Alias: ISO646-IT Alias: csISO15Italian

Name: ES [RFC1345,KXS2] MIBenum: 23 Source: ECMA registry Alias: iso-ir-17 Alias: ISO646-ES Alias: csISO17Spanish

Name: DIN_66003 [RFC1345,KXS2] MIBenum: 24 Source: ECMA registry Alias: iso-ir-21 Alias: de Alias: ISO646-DE Alias: csISO21German

Name: NS_4551-1 [RFC1345,KXS2] MIBenum: 25 Source: ECMA registry Alias: iso-ir-60 Alias: ISO646-NO Alias: no Alias: csISO60DanishNorwegian Alias: csISO60Norwegian1

Name: NF_Z_62-010 [RFC1345,KXS2] MIBenum: 26 Source: ECMA registry Alias: iso-ir-69 Alias: ISO646-FR Alias: fr Alias: csISO69French

Name: ISO-10646-UTF-1 MIBenum: 27 Source: Universal Transfer Format (1), this is the multibyte encoding, that subsets ASCII-7. It does not have byte ordering issues. Alias: csISO10646UTF1

Name: ISO_646.basic:1983 [RFC1345,KXS2] MIBenum: 28 Source: ECMA registry Alias: ref Alias: csISO646basic1983

Name: INVARIANT [RFC1345,KXS2] MIBenum: 29 Alias: csINVARIANT

Name: ISO_646.irv:1983 [RFC1345,KXS2] MIBenum: 30 Source: ECMA registry Alias: iso-ir-2 Alias: irv Alias: csISO2IntlRefVersion

Name: NATS-SEFI [RFC1345,KXS2] MIBenum: 31 Source: ECMA registry Alias: iso-ir-8-1 Alias: csNATSSEFI

Name: NATS-SEFI-ADD [RFC1345,KXS2] MIBenum: 32 Source: ECMA registry Alias: iso-ir-8-2 Alias: csNATSSEFIADD

Name: NATS-DANO [RFC1345,KXS2] MIBenum: 33 Source: ECMA registry Alias: iso-ir-9-1 Alias: csNATSDANO

Name: NATS-DANO-ADD [RFC1345,KXS2] MIBenum: 34 Source: ECMA registry Alias: iso-ir-9-2 Alias: csNATSDANOADD

Name: SEN_850200_B [RFC1345,KXS2] MIBenum: 35 Source: ECMA registry Alias: iso-ir-10 Alias: FI Alias: ISO646-FI Alias: ISO646-SE Alias: se Alias: csISO10Swedish

Name: KS_C_5601-1987 [RFC1345,KXS2] MIBenum: 36 Source: ECMA registry Alias: iso-ir-149 Alias: KS_C_5601-1989 Alias: KSC_5601 Alias: korean Alias: csKSC56011987

Name: ISO-2022-KR (preferred MIME name) [RFC1557,Choi] MIBenum: 37 Source: RFC-1557 (see also KS_C_5601-1987) Alias: csISO2022KR

Name: EUC-KR (preferred MIME name) [RFC1557,Choi] MIBenum: 38 Source: RFC-1557 (see also KS_C_5861-1992) Alias: csEUCKR

Name: ISO-2022-JP (preferred MIME name) [RFC1468,Murai] MIBenum: 39 Source: RFC-1468 (see also RFC-2237) Alias: csISO2022JP

Name: ISO-2022-JP-2 (preferred MIME name) [RFC1554,Ohta] MIBenum: 40 Source: RFC-1554 Alias: csISO2022JP2

Name: JIS_C6220-1969-jp [RFC1345,KXS2] MIBenum: 41 Source: ECMA registry Alias: JIS_C6220-1969 Alias: iso-ir-13 Alias: katakana Alias: x0201-7 Alias: csISO13JISC6220jp

Name: JIS_C6220-1969-ro [RFC1345,KXS2] MIBenum: 42 Source: ECMA registry Alias: iso-ir-14 Alias: jp Alias: ISO646-JP Alias: csISO14JISC6220ro

Name: PT [RFC1345,KXS2] MIBenum: 43 Source: ECMA registry Alias: iso-ir-16 Alias: ISO646-PT Alias: csISO16Portuguese

Name: greek7-old [RFC1345,KXS2] MIBenum: 44 Source: ECMA registry Alias: iso-ir-18 Alias: csISO18Greek7Old

Name: latin-greek [RFC1345,KXS2] MIBenum: 45 Source: ECMA registry Alias: iso-ir-19 Alias: csISO19LatinGreek

Name: NF_Z_62-010_(1973) [RFC1345,KXS2] MIBenum: 46 Source: ECMA registry Alias: iso-ir-25 Alias: ISO646-FR1 Alias: csISO25French

Name: Latin-greek-1 [RFC1345,KXS2] MIBenum: 47 Source: ECMA registry Alias: iso-ir-27 Alias: csISO27LatinGreek1

Name: ISO_5427 [RFC1345,KXS2] MIBenum: 48 Source: ECMA registry Alias: iso-ir-37 Alias: csISO5427Cyrillic

Name: JIS_C6226-1978 [RFC1345,KXS2] MIBenum: 49 Source: ECMA registry Alias: iso-ir-42 Alias: csISO42JISC62261978

Name: BS_viewdata [RFC1345,KXS2] MIBenum: 50 Source: ECMA registry Alias: iso-ir-47 Alias: csISO47BSViewdata

Name: INIS [RFC1345,KXS2] MIBenum: 51 Source: ECMA registry Alias: iso-ir-49 Alias: csISO49INIS

Name: INIS-8 [RFC1345,KXS2] MIBenum: 52 Source: ECMA registry Alias: iso-ir-50 Alias: csISO50INIS8

Name: INIS-cyrillic [RFC1345,KXS2] MIBenum: 53 Source: ECMA registry Alias: iso-ir-51 Alias: csISO51INISCyrillic

Name: ISO_5427:1981 [RFC1345,KXS2] MIBenum: 54 Source: ECMA registry Alias: iso-ir-54 Alias: ISO5427Cyrillic1981

Name: ISO_5428:1980 [RFC1345,KXS2] MIBenum: 55 Source: ECMA registry Alias: iso-ir-55 Alias: csISO5428Greek

Name: GB_1988-80 [RFC1345,KXS2] MIBenum: 56 Source: ECMA registry Alias: iso-ir-57 Alias: cn Alias: ISO646-CN Alias: csISO57GB1988

Name: GB_2312-80 [RFC1345,KXS2] MIBenum: 57 Source: ECMA registry Alias: iso-ir-58 Alias: chinese Alias: csISO58GB231280

Name: NS_4551-2 [RFC1345,KXS2] MIBenum: 58 Source: ECMA registry Alias: ISO646-NO2 Alias: iso-ir-61 Alias: no2 Alias: csISO61Norwegian2

Name: videotex-suppl [RFC1345,KXS2] MIBenum: 59 Source: ECMA registry Alias: iso-ir-70 Alias: csISO70VideotexSupp1

Name: PT2 [RFC1345,KXS2] MIBenum: 60 Source: ECMA registry Alias: iso-ir-84 Alias: ISO646-PT2 Alias: csISO84Portuguese2

Name: ES2 [RFC1345,KXS2] MIBenum: 61 Source: ECMA registry Alias: iso-ir-85 Alias: ISO646-ES2 Alias: csISO85Spanish2

Name: MSZ_7795.3 [RFC1345,KXS2] MIBenum: 62 Source: ECMA registry Alias: iso-ir-86 Alias: ISO646-HU Alias: hu Alias: csISO86Hungarian

Name: JIS_C6226-1983 [RFC1345,KXS2] MIBenum: 63 Source: ECMA registry Alias: iso-ir-87 Alias: x0208 Alias: JIS_X0208-1983 Alias: csISO87JISX0208

Name: greek7 [RFC1345,KXS2] MIBenum: 64 Source: ECMA registry Alias: iso-ir-88 Alias: csISO88Greek7

Name: ASMO_449 [RFC1345,KXS2] MIBenum: 65 Source: ECMA registry Alias: ISO_9036 Alias: arabic7 Alias: iso-ir-89 Alias: csISO89ASMO449

Name: iso-ir-90 [RFC1345,KXS2] MIBenum: 66 Source: ECMA registry Alias: csISO90

Name: JIS_C6229-1984-a [RFC1345,KXS2] MIBenum: 67 Source: ECMA registry Alias: iso-ir-91 Alias: jp-ocr-a Alias: csISO91JISC62291984a

Name: JIS_C6229-1984-b [RFC1345,KXS2] MIBenum: 68 Source: ECMA registry Alias: iso-ir-92 Alias: ISO646-JP-OCR-B Alias: jp-ocr-b Alias: csISO92JISC62991984b

Name: JIS_C6229-1984-b-add [RFC1345,KXS2] MIBenum: 69 Source: ECMA registry Alias: iso-ir-93 Alias: jp-ocr-b-add Alias: csISO93JIS62291984badd

Name: JIS_C6229-1984-hand [RFC1345,KXS2] MIBenum: 70 Source: ECMA registry Alias: iso-ir-94 Alias: jp-ocr-hand Alias: csISO94JIS62291984hand

Name: JIS_C6229-1984-hand-add [RFC1345,KXS2] MIBenum: 71 Source: ECMA registry Alias: iso-ir-95 Alias: jp-ocr-hand-add Alias: csISO95JIS62291984handadd

Name: JIS_C6229-1984-kana [RFC1345,KXS2] MIBenum: 72 Source: ECMA registry Alias: iso-ir-96 Alias: csISO96JISC62291984kana

Name: ISO_2033-1983 [RFC1345,KXS2] MIBenum: 73 Source: ECMA registry Alias: iso-ir-98 Alias: e13b Alias: csISO2033

Name: ANSI_X3.110-1983 [RFC1345,KXS2] MIBenum: 74 Source: ECMA registry Alias: iso-ir-99 Alias: CSA_T500-1983 Alias: NAPLPS Alias: csISO99NAPLPS

Name: T.61-7bit [RFC1345,KXS2] MIBenum: 75 Source: ECMA registry Alias: iso-ir-102 Alias: csISO102T617bit

Name: T.61-8bit [RFC1345,KXS2] MIBenum: 76 Alias: T.61 Source: ECMA registry Alias: iso-ir-103 Alias: csISO103T618bit

Name: ECMA-cyrillic MIBenum: 77 Source: ISO registry (formerly ECMA registry) http://www.itscj.ipsj.jp/ISO-IR/111.pdf Alias: iso-ir-111 Alias: KOI8-E Alias: csISO111ECMACyrillic

Name: CSA_Z243.4-1985-1 [RFC1345,KXS2] MIBenum: 78 Source: ECMA registry Alias: iso-ir-121 Alias: ISO646-CA Alias: csa7-1 Alias: ca Alias: csISO121Canadian1

Name: CSA_Z243.4-1985-2 [RFC1345,KXS2] MIBenum: 79 Source: ECMA registry Alias: iso-ir-122 Alias: ISO646-CA2 Alias: csa7-2 Alias: csISO122Canadian2

Name: CSA_Z243.4-1985-gr [RFC1345,KXS2] MIBenum: 80 Source: ECMA registry Alias: iso-ir-123 Alias: csISO123CSAZ24341985gr

Name: ISO_8859-6-E [RFC1556,IANA] MIBenum: 81 Source: RFC1556 Alias: csISO88596E Alias: ISO-8859-6-E (preferred MIME name)

Name: ISO_8859-6-I [RFC1556,IANA] MIBenum: 82 Source: RFC1556 Alias: csISO88596I Alias: ISO-8859-6-I (preferred MIME name)

Name: T.101-G2 [RFC1345,KXS2] MIBenum: 83 Source: ECMA registry Alias: iso-ir-128 Alias: csISO128T101G2

Name: ISO_8859-8-E [RFC1556,Nussbacher] MIBenum: 84 Source: RFC1556 Alias: csISO88598E Alias: ISO-8859-8-E (preferred MIME name)

Name: ISO_8859-8-I [RFC1556,Nussbacher] MIBenum: 85 Source: RFC1556 Alias: csISO88598I Alias: ISO-8859-8-I (preferred MIME name)

Name: CSN_369103 [RFC1345,KXS2] MIBenum: 86 Source: ECMA registry Alias: iso-ir-139 Alias: csISO139CSN369103

Name: JUS_I.B1.002 [RFC1345,KXS2] MIBenum: 87 Source: ECMA registry Alias: iso-ir-141 Alias: ISO646-YU Alias: js Alias: yu Alias: csISO141JUSIB1002

Name: IEC_P27-1 [RFC1345,KXS2] MIBenum: 88 Source: ECMA registry Alias: iso-ir-143 Alias: csISO143IECP271

Name: JUS_I.B1.003-serb [RFC1345,KXS2] MIBenum: 89 Source: ECMA registry Alias: iso-ir-146 Alias: serbian Alias: csISO146Serbian

Name: JUS_I.B1.003-mac [RFC1345,KXS2] MIBenum: 90 Source: ECMA registry Alias: macedonian Alias: iso-ir-147 Alias: csISO147Macedonian

Name: greek-ccitt [RFC1345,KXS2] MIBenum: 91 Source: ECMA registry Alias: iso-ir-150 Alias: csISO150 Alias: csISO150GreekCCITT

Name: NC_NC00-10:81 [RFC1345,KXS2] MIBenum: 92 Source: ECMA registry Alias: cuba Alias: iso-ir-151 Alias: ISO646-CU Alias: csISO151Cuba

Name: ISO_6937-2-25 [RFC1345,KXS2] MIBenum: 93 Source: ECMA registry Alias: iso-ir-152 Alias: csISO6937Add

Name: GOST_19768-74 [RFC1345,KXS2] MIBenum: 94 Source: ECMA registry Alias: ST_SEV_358-88 Alias: iso-ir-153 Alias: csISO153GOST1976874

Name: ISO_8859-supp [RFC1345,KXS2] MIBenum: 95 Source: ECMA registry Alias: iso-ir-154 Alias: latin1-2-5 Alias: csISO8859Supp

Name: ISO_10367-box [RFC1345,KXS2] MIBenum: 96 Source: ECMA registry Alias: iso-ir-155 Alias: csISO10367Box

Name: latin-lap [RFC1345,KXS2] MIBenum: 97 Source: ECMA registry Alias: lap Alias: iso-ir-158 Alias: csISO158Lap

Name: JIS_X0212-1990 [RFC1345,KXS2] MIBenum: 98 Source: ECMA registry Alias: x0212 Alias: iso-ir-159 Alias: csISO159JISX02121990

Name: DS_2089 [RFC1345,KXS2] MIBenum: 99 Source: Danish Standard, DS 2089, February 1974 Alias: DS2089 Alias: ISO646-DK Alias: dk Alias: csISO646Danish

Name: us-dk [RFC1345,KXS2] MIBenum: 100 Alias: csUSDK

Name: dk-us [RFC1345,KXS2] MIBenum: 101 Alias: csDKUS

Name: KSC5636 [RFC1345,KXS2] MIBenum: 102 Alias: ISO646-KR Alias: csKSC5636

Name: UNICODE-1-1-UTF-7 [RFC1642] MIBenum: 103 Source: RFC 1642 Alias: csUnicode11UTF7

Name: ISO-2022-CN [RFC1922] MIBenum: 104 Source: RFC-1922

Name: ISO-2022-CN-EXT [RFC1922] MIBenum: 105 Source: RFC-1922

Name: UTF-8 [RFC3629] MIBenum: 106 Source: RFC 3629 Alias: None

Name: ISO-8859-13 MIBenum: 109 Source: ISO See (http://www.iana.org/assignments/charset-reg/ISO-8859-13)[Tumasonis] Alias: None

Name: ISO-8859-14 MIBenum: 110 Source: ISO See (http://www.iana.org/assignments/charset-reg/ISO-8859-14) [Simonsen] Alias: iso-ir-199 Alias: ISO_8859-14:1998 Alias: ISO_8859-14 Alias: latin8 Alias: iso-celtic Alias: l8

Name: ISO-8859-15 MIBenum: 111 Source: ISO Please see: <http://www.iana.org/assignments/charset-reg/ISO-8859-15> Alias: ISO_8859-15 Alias: Latin-9

Name: ISO-8859-16 MIBenum: 112 Source: ISO Alias: iso-ir-226 Alias: ISO_8859-16:2001 Alias: ISO_8859-16 Alias: latin10 Alias: l10

Name: GBK MIBenum: 113 Source: Chinese IT Standardization Technical Committee Please see: <http://www.iana.org/assignments/charset-reg/GBK> Alias: CP936 Alias: MS936 Alias: windows-936

Name: GB18030 MIBenum: 114 Source: Chinese IT Standardization Technical Committee Please see: <http://www.iana.org/assignments/charset-reg/GB18030> Alias: None

Name: OSD_EBCDIC_DF04_15 MIBenum: 115 Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-15> Alias: None

Name: OSD_EBCDIC_DF03_IRV MIBenum: 116 Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF03-IRV> Alias: None

Name: OSD_EBCDIC_DF04_1 MIBenum: 117 Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-1> Alias: None

Name: ISO-11548-1 MIBenum: 118 Source: See <http://www.iana.org/assignments/charset-reg/ISO-11548-1> [Thibault] Alias: ISO_11548-1 Alias: ISO_TR_11548-1 Alias: csISO115481

Name: KZ-1048 MIBenum: 119 Source: See <http://www.iana.org/assignments/charset-reg/KZ-1048> [Veremeev, Kikkarin] Alias: STRK1048-2002 Alias: RK1048 Alias: csKZ1048

Name: ISO-10646-UCS-2 MIBenum: 1000 Source: the 2-octet Basic Multilingual Plane, aka Unicode this needs to specify network byte order: the standard does not specify (it is a 16-bit integer space) Alias: csUnicode

Name: ISO-10646-UCS-4 MIBenum: 1001 Source: the full code space. (same comment about byte order, these are 31-bit numbers. Alias: csUCS4

Name: ISO-10646-UCS-Basic MIBenum: 1002 Source: ASCII subset of Unicode. Basic Latin = collection 1 See ISO 10646, Appendix A Alias: csUnicodeASCII

Name: ISO-10646-Unicode-Latin1 MIBenum: 1003 Source: ISO Latin-1 subset of Unicode. Basic Latin and Latin-1 Supplement = collections 1 and 2. See ISO 10646, Appendix A. See RFC 1815. Alias: csUnicodeLatin1 Alias: ISO-10646

Name: ISO-10646-J-1 Source: ISO 10646 Japanese, see RFC 1815.

Name: ISO-Unicode-IBM-1261 MIBenum: 1005 Source: IBM Latin-2, -3, -5, Extended Presentation Set, GCSGID: 1261 Alias: csUnicodeIBM1261

Name: ISO-Unicode-IBM-1268 MIBenum: 1006 Source: IBM Latin-4 Extended Presentation Set, GCSGID: 1268 Alias: csUnicodeIBM1268

Name: ISO-Unicode-IBM-1276 MIBenum: 1007 Source: IBM Cyrillic Greek Extended Presentation Set, GCSGID: 1276 Alias: csUnicodeIBM1276

Name: ISO-Unicode-IBM-1264 MIBenum: 1008 Source: IBM Arabic Presentation Set, GCSGID: 1264 Alias: csUnicodeIBM1264

Name: ISO-Unicode-IBM-1265 MIBenum: 1009 Source: IBM Hebrew Presentation Set, GCSGID: 1265 Alias: csUnicodeIBM1265

Name: UNICODE-1-1 [RFC1641] MIBenum: 1010 Source: RFC 1641 Alias: csUnicode11

Name: SCSU MIBenum: 1011 Source: SCSU See (http://www.iana.org/assignments/charset-reg/SCSU) [Scherer] Alias: None

Name: UTF-7 [RFC2152] MIBenum: 1012 Source: RFC 2152 Alias: None

Name: UTF-16BE [RFC2781] MIBenum: 1013 Source: RFC 2781 Alias: None

Name: UTF-16LE [RFC2781] MIBenum: 1014 Source: RFC 2781 Alias: None

Name: UTF-16 [RFC2781] MIBenum: 1015 Source: RFC 2781 Alias: None

Name: CESU-8 [Phipps] MIBenum: 1016 Source: <http://www.unicode.org/unicode/reports/tr26> Alias: csCESU-8

Name: UTF-32 [Davis] MIBenum: 1017 Source: <http://www.unicode.org/unicode/reports/tr19/> Alias: None

Name: UTF-32BE [Davis] MIBenum: 1018 Source: <http://www.unicode.org/unicode/reports/tr19/> Alias: None

Name: UTF-32LE [Davis] MIBenum: 1019 Source: <http://www.unicode.org/unicode/reports/tr19/> Alias: None

Name: BOCU-1 [Scherer] MIBenum: 1020 Source: http://www.unicode.org/notes/tn6/ Alias: csBOCU-1

Name: ISO-8859-1-Windows-3.0-Latin-1 [HP-PCL5] MIBenum: 2000 Source: Extended ISO 8859-1 Latin-1 for Windows 3.0. PCL Symbol Set id: 9U Alias: csWindows30Latin1

Name: ISO-8859-1-Windows-3.1-Latin-1 [HP-PCL5] MIBenum: 2001 Source: Extended ISO 8859-1 Latin-1 for Windows 3.1. PCL Symbol Set id: 19U Alias: csWindows31Latin1

Name: ISO-8859-2-Windows-Latin-2 [HP-PCL5] MIBenum: 2002 Source: Extended ISO 8859-2. Latin-2 for Windows 3.1. PCL Symbol Set id: 9E Alias: csWindows31Latin2

Name: ISO-8859-9-Windows-Latin-5 [HP-PCL5] MIBenum: 2003 Source: Extended ISO 8859-9. Latin-5 for Windows 3.1 PCL Symbol Set id: 5T Alias: csWindows31Latin5

Name: hp-roman8 [HP-PCL5,RFC1345,KXS2] MIBenum: 2004 Source: LaserJet IIP Printer User's Manual, HP part no 33471-90901, Hewlet-Packard, June 1989. Alias: roman8 Alias: r8 Alias: csHPRoman8

Name: Adobe-Standard-Encoding [Adobe] MIBenum: 2005 Source: PostScript Language Reference Manual PCL Symbol Set id: 10J Alias: csAdobeStandardEncoding

Name: Ventura-US [HP-PCL5] MIBenum: 2006 Source: Ventura US. ASCII plus characters typically used in publishing, like pilcrow, copyright, registered, trade mark, section, dagger, and double dagger in the range A0 (hex) to FF (hex). PCL Symbol Set id: 14J Alias: csVenturaUS

Name: Ventura-International [HP-PCL5] MIBenum: 2007 Source: Ventura International. ASCII plus coded characters similar to Roman8. PCL Symbol Set id: 13J Alias: csVenturaInternational

Name: DEC-MCS [RFC1345,KXS2] MIBenum: 2008 Source: VAX/VMS User's Manual, Order Number: AI-Y517A-TE, April 1986. Alias: dec Alias: csDECMCS

Name: IBM850 [RFC1345,KXS2] MIBenum: 2009 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp850 Alias: 850 Alias: csPC850Multilingual

Name: PC8-Danish-Norwegian [HP-PCL5] MIBenum: 2012 Source: PC Danish Norwegian 8-bit PC set for Danish Norwegian PCL Symbol Set id: 11U Alias: csPC8DanishNorwegian

Name: IBM862 [RFC1345,KXS2] MIBenum: 2013 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp862 Alias: 862 Alias: csPC862LatinHebrew

Name: PC8-Turkish [HP-PCL5] MIBenum: 2014 Source: PC Latin Turkish. PCL Symbol Set id: 9T Alias: csPC8Turkish

Name: IBM-Symbols [IBM-CIDT] MIBenum: 2015 Source: Presentation Set, CPGID: 259 Alias: csIBMSymbols

Name: IBM-Thai [IBM-CIDT] MIBenum: 2016 Source: Presentation Set, CPGID: 838 Alias: csIBMThai

Name: HP-Legal [HP-PCL5] MIBenum: 2017 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 1U Alias: csHPLegal

Name: HP-Pi-font [HP-PCL5] MIBenum: 2018 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 15U Alias: csHPPiFont

Name: HP-Math8 [HP-PCL5] MIBenum: 2019 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 8M Alias: csHPMath8

Name: Adobe-Symbol-Encoding [Adobe] MIBenum: 2020 Source: PostScript Language Reference Manual PCL Symbol Set id: 5M Alias: csHPPSMath

Name: HP-DeskTop [HP-PCL5] MIBenum: 2021 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 7J Alias: csHPDesktop

Name: Ventura-Math [HP-PCL5] MIBenum: 2022 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 6M Alias: csVenturaMath

Name: Microsoft-Publishing [HP-PCL5] MIBenum: 2023 Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 6J Alias: csMicrosoftPublishing

Name: Windows-31J MIBenum: 2024 Source: Windows Japanese. A further extension of Shift_JIS to include NEC special characters (Row 13), NEC selection of IBM extensions (Rows 89 to 92), and IBM extensions (Rows 115 to 119). The CCS's are JIS X0201:1997, JIS X0208:1997, and these extensions. This charset can be used for the top-level media type ``text'', but it is of limited or specialized use (see RFC2278). PCL Symbol Set id: 19K Alias: csWindows31J

Name: GB2312 (preferred MIME name) MIBenum: 2025 Source: Chinese for People's Republic of China (PRC) mixed one byte, two byte set: 20-7E = one byte ASCII A1-FE = two byte PRC Kanji See GB 2312-80 PCL Symbol Set Id: 18C Alias: csGB2312

Name: Big5 (preferred MIME name) MIBenum: 2026 Source: Chinese for Taiwan Multi-byte set. PCL Symbol Set Id: 18T Alias: csBig5

Name: macintosh [RFC1345,KXS2] MIBenum: 2027 Source: The Unicode Standard ver1.0, ISBN 0-201-56788-1, Oct 1991 Alias: mac Alias: csMacintosh

Name: IBM037 [RFC1345,KXS2] MIBenum: 2028 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp037 Alias: ebcdic-cp-us Alias: ebcdic-cp-ca Alias: ebcdic-cp-wt Alias: ebcdic-cp-nl Alias: csIBM037

Name: IBM038 [RFC1345,KXS2] MIBenum: 2029 Source: IBM 3174 Character Set Ref, GA27-3831-02, March 1990 Alias: EBCDIC-INT Alias: cp038 Alias: csIBM038

Name: IBM273 [RFC1345,KXS2] MIBenum: 2030 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP273 Alias: csIBM273

Name: IBM274 [RFC1345,KXS2] MIBenum: 2031 Source: IBM 3174 Character Set Ref, GA27-3831-02, March 1990 Alias: EBCDIC-BE Alias: CP274 Alias: csIBM274

Name: IBM275 [RFC1345,KXS2] MIBenum: 2032 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: EBCDIC-BR Alias: cp275 Alias: csIBM275

Name: IBM277 [RFC1345,KXS2] MIBenum: 2033 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: EBCDIC-CP-DK Alias: EBCDIC-CP-NO Alias: csIBM277

Name: IBM278 [RFC1345,KXS2] MIBenum: 2034 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP278 Alias: ebcdic-cp-fi Alias: ebcdic-cp-se Alias: csIBM278

Name: IBM280 [RFC1345,KXS2] MIBenum: 2035 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP280 Alias: ebcdic-cp-it Alias: csIBM280

Name: IBM281 [RFC1345,KXS2] MIBenum: 2036 Source: IBM 3174 Character Set Ref, GA27-3831-02, March 1990 Alias: EBCDIC-JP-E Alias: cp281 Alias: csIBM281

Name: IBM284 [RFC1345,KXS2] MIBenum: 2037 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP284 Alias: ebcdic-cp-es Alias: csIBM284

Name: IBM285 [RFC1345,KXS2] MIBenum: 2038 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP285 Alias: ebcdic-cp-gb Alias: csIBM285

Name: IBM290 [RFC1345,KXS2] MIBenum: 2039 Source: IBM 3174 Character Set Ref, GA27-3831-02, March 1990 Alias: cp290 Alias: EBCDIC-JP-kana Alias: csIBM290

Name: IBM297 [RFC1345,KXS2] MIBenum: 2040 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp297 Alias: ebcdic-cp-fr Alias: csIBM297

Name: IBM420 [RFC1345,KXS2] MIBenum: 2041 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990, IBM NLS RM p 11-11 Alias: cp420 Alias: ebcdic-cp-ar1 Alias: csIBM420

Name: IBM423 [RFC1345,KXS2] MIBenum: 2042 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp423 Alias: ebcdic-cp-gr Alias: csIBM423

Name: IBM424 [RFC1345,KXS2] MIBenum: 2043 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp424 Alias: ebcdic-cp-he Alias: csIBM424

Name: IBM437 [RFC1345,KXS2] MIBenum: 2011 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp437 Alias: 437 Alias: csPC8CodePage437

Name: IBM500 [RFC1345,KXS2] MIBenum: 2044 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP500 Alias: ebcdic-cp-be Alias: ebcdic-cp-ch Alias: csIBM500

Name: IBM851 [RFC1345,KXS2] MIBenum: 2045 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp851 Alias: 851 Alias: csIBM851

Name: IBM852 [RFC1345,KXS2] MIBenum: 2010 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp852 Alias: 852 Alias: csPCp852

Name: IBM855 [RFC1345,KXS2] MIBenum: 2046 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp855 Alias: 855 Alias: csIBM855

Name: IBM857 [RFC1345,KXS2] MIBenum: 2047 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp857 Alias: 857 Alias: csIBM857

Name: IBM860 [RFC1345,KXS2] MIBenum: 2048 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp860 Alias: 860 Alias: csIBM860

Name: IBM861 [RFC1345,KXS2] MIBenum: 2049 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp861 Alias: 861 Alias: cp-is Alias: csIBM861

Name: IBM863 [RFC1345,KXS2] MIBenum: 2050 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp863 Alias: 863 Alias: csIBM863

Name: IBM864 [RFC1345,KXS2] MIBenum: 2051 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp864 Alias: csIBM864

Name: IBM865 [RFC1345,KXS2] MIBenum: 2052 Source: IBM DOS 3.3 Ref (Abridged), 94X9575 (Feb 1987) Alias: cp865 Alias: 865 Alias: csIBM865

Name: IBM868 [RFC1345,KXS2] MIBenum: 2053 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP868 Alias: cp-ar Alias: csIBM868

Name: IBM869 [RFC1345,KXS2] MIBenum: 2054 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp869 Alias: 869 Alias: cp-gr Alias: csIBM869

Name: IBM870 [RFC1345,KXS2] MIBenum: 2055 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP870 Alias: ebcdic-cp-roece Alias: ebcdic-cp-yu Alias: csIBM870

Name: IBM871 [RFC1345,KXS2] MIBenum: 2056 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP871 Alias: ebcdic-cp-is Alias: csIBM871

Name: IBM880 [RFC1345,KXS2] MIBenum: 2057 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp880 Alias: EBCDIC-Cyrillic Alias: csIBM880

Name: IBM891 [RFC1345,KXS2] MIBenum: 2058 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp891 Alias: csIBM891

Name: IBM903 [RFC1345,KXS2] MIBenum: 2059 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp903 Alias: csIBM903

Name: IBM904 [RFC1345,KXS2] MIBenum: 2060 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp904 Alias: 904 Alias: csIBBM904

Name: IBM905 [RFC1345,KXS2] MIBenum: 2061 Source: IBM 3174 Character Set Ref, GA27-3831-02, March 1990 Alias: CP905 Alias: ebcdic-cp-tr Alias: csIBM905

Name: IBM918 [RFC1345,KXS2] MIBenum: 2062 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP918 Alias: ebcdic-cp-ar2 Alias: csIBM918

Name: IBM1026 [RFC1345,KXS2] MIBenum: 2063 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: CP1026 Alias: csIBM1026

Name: EBCDIC-AT-DE [RFC1345,KXS2] MIBenum: 2064 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csIBMEBCDICATDE

Name: EBCDIC-AT-DE-A [RFC1345,KXS2] MIBenum: 2065 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICATDEA

Name: EBCDIC-CA-FR [RFC1345,KXS2] MIBenum: 2066 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICCAFR

Name: EBCDIC-DK-NO [RFC1345,KXS2] MIBenum: 2067 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICDKNO

Name: EBCDIC-DK-NO-A [RFC1345,KXS2] MIBenum: 2068 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICDKNOA

Name: EBCDIC-FI-SE [RFC1345,KXS2] MIBenum: 2069 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICFISE

Name: EBCDIC-FI-SE-A [RFC1345,KXS2] MIBenum: 2070 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICFISEA

Name: EBCDIC-FR [RFC1345,KXS2] MIBenum: 2071 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICFR

Name: EBCDIC-IT [RFC1345,KXS2] MIBenum: 2072 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICIT

Name: EBCDIC-PT [RFC1345,KXS2] MIBenum: 2073 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICPT

Name: EBCDIC-ES [RFC1345,KXS2] MIBenum: 2074 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICES

Name: EBCDIC-ES-A [RFC1345,KXS2] MIBenum: 2075 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICESA

Name: EBCDIC-ES-S [RFC1345,KXS2] MIBenum: 2076 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICESS

Name: EBCDIC-UK [RFC1345,KXS2] MIBenum: 2077 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICUK

Name: EBCDIC-US [RFC1345,KXS2] MIBenum: 2078 Source: IBM 3270 Char Set Ref Ch 10, GA27-2837-9, April 1987 Alias: csEBCDICUS

Name: UNKNOWN-8BIT [RFC1428] MIBenum: 2079 Alias: csUnknown8BiT

Name: MNEMONIC [RFC1345,KXS2] MIBenum: 2080 Source: RFC 1345, also known as ``mnemonic+ascii+38'' Alias: csMnemonic

Name: MNEM [RFC1345,KXS2] MIBenum: 2081 Source: RFC 1345, also known as ``mnemonic+ascii+8200'' Alias: csMnem

Name: VISCII [RFC1456] MIBenum: 2082 Source: RFC 1456 Alias: csVISCII

Name: VIQR [RFC1456] MIBenum: 2083 Source: RFC 1456 Alias: csVIQR

Name: KOI8-R (preferred MIME name) [RFC1489] MIBenum: 2084 Source: RFC 1489, based on GOST-19768-74, ISO-6937/8, INIS-Cyrillic, ISO-5427. Alias: csKOI8R

Name: HZ-GB-2312 MIBenum: 2085 Source: RFC 1842, RFC 1843 [RFC1842, RFC1843]

Name: IBM866 [Pond] MIBenum: 2086 Source: IBM NLDG Volume 2 (SE09-8002-03) August 1994 Alias: cp866 Alias: 866 Alias: csIBM866

Name: IBM775 [HP-PCL5] MIBenum: 2087 Source: HP PCL 5 Comparison Guide (P/N 5021-0329) pp B-13, 1996 Alias: cp775 Alias: csPC775Baltic

Name: KOI8-U [RFC2319] MIBenum: 2088 Source: RFC 2319

Name: IBM00858 MIBenum: 2089 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM00858) [Mahdi] Alias: CCSID00858 Alias: CP00858 Alias: PC-Multilingual-850+euro

Name: IBM00924 MIBenum: 2090 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM00924) [Mahdi] Alias: CCSID00924 Alias: CP00924 Alias: ebcdic-Latin9--euro

Name: IBM01140 MIBenum: 2091 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01140) [Mahdi] Alias: CCSID01140 Alias: CP01140 Alias: ebcdic-us-37+euro

Name: IBM01141 MIBenum: 2092 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01141) [Mahdi] Alias: CCSID01141 Alias: CP01141 Alias: ebcdic-de-273+euro

Name: IBM01142 MIBenum: 2093 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01142) [Mahdi] Alias: CCSID01142 Alias: CP01142 Alias: ebcdic-dk-277+euro Alias: ebcdic-no-277+euro

Name: IBM01143 MIBenum: 2094 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01143) [Mahdi] Alias: CCSID01143 Alias: CP01143 Alias: ebcdic-fi-278+euro Alias: ebcdic-se-278+euro

Name: IBM01144 MIBenum: 2095 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01144) [Mahdi] Alias: CCSID01144 Alias: CP01144 Alias: ebcdic-it-280+euro

Name: IBM01145 MIBenum: 2096 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01145) [Mahdi] Alias: CCSID01145 Alias: CP01145 Alias: ebcdic-es-284+euro

Name: IBM01146 MIBenum: 2097 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01146) [Mahdi] Alias: CCSID01146 Alias: CP01146 Alias: ebcdic-gb-285+euro

Name: IBM01147 MIBenum: 2098 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01147) [Mahdi] Alias: CCSID01147 Alias: CP01147 Alias: ebcdic-fr-297+euro

Name: IBM01148 MIBenum: 2099 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01148) [Mahdi] Alias: CCSID01148 Alias: CP01148 Alias: ebcdic-international-500+euro

Name: IBM01149 MIBenum: 2100 Source: IBM See (http://www.iana.org/assignments/charset-reg/IBM01149) [Mahdi] Alias: CCSID01149 Alias: CP01149 Alias: ebcdic-is-871+euro

Name: Big5-HKSCS [Yick] MIBenum: 2101 Source: See (http://www.iana.org/assignments/charset-reg/Big5-HKSCS) Alias: None

Name: IBM1047 [Robrigado] MIBenum: 2102 Source: IBM1047 (EBCDIC Latin 1/Open Systems) http://www-1.ibm.com/servers/eserver/iseries/software/globalization/pdf/cp01047z.pdf Alias: IBM-1047

Name: PTCP154 [Uskov] MIBenum: 2103 Source: See (http://www.iana.org/assignments/charset-reg/PTCP154) Alias: csPTCP154 Alias: PT154 Alias: CP154 Alias: Cyrillic-Asian

Name: Amiga-1251 MIBenum: 2104 Source: See (http://www.amiga.ultranet.ru/Amiga-1251.html) Alias: Ami1251 Alias: Amiga1251 Alias: Ami-1251 (Aliases are provided for historical reasons and should not be used) [Malyshev]




Name:  KOI7-switched

MIBenum:  2105

Source:  See <http://www.iana.org/assignments/charset-reg/KOI7-switched>;

Aliases:  None

Name: BRF MIBenum: 2106 Source: See <http://www.iana.org/assignments/charset-reg/BRF> [Thibault] Alias: csBRF

Name: TSCII MIBenum: 2107 Source: See <http://www.iana.org/assignments/charset-reg/TSCII> [Kalyanasundaram] Alias: csTSCII

Name: windows-1250 MIBenum: 2250 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1250) [Lazhintseva] Alias: None

Name: windows-1251 MIBenum: 2251 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1251) [Lazhintseva] Alias: None

Name: windows-1252 MIBenum: 2252 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1252) [Wendt] Alias: None

Name: windows-1253 MIBenum: 2253 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1253) [Lazhintseva] Alias: None

Name: windows-1254 MIBenum: 2254 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1254) [Lazhintseva] Alias: None

Name: windows-1255 MIBenum: 2255 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1255) [Lazhintseva] Alias: None

Name: windows-1256 MIBenum: 2256 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1256) [Lazhintseva] Alias: None

Name: windows-1257 MIBenum: 2257 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1257) [Lazhintseva] Alias: None

Name: windows-1258 MIBenum: 2258 Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1258) [Lazhintseva] Alias: None

Name: TIS-620 MIBenum: 2259 Source: Thai Industrial Standards Institute (TISI) [Tantsetthi]




REFERENCES

----------

[RFC1345] Simonsen, K., ``Character Mnemonics & Character Sets'', RFC 1345, Rationel Almen Planlaegning, Rationel Almen Planlaegning, June 1992.

[RFC1428] Vaudreuil, G., ``Transition of Internet Mail from Just-Send-8 to 8bit-SMTP/MIME'', RFC1428, CNRI, February 1993.

[RFC1456] Vietnamese Standardization Working Group, ``Conventions for Encoding the Vietnamese Language VISCII: VIetnamese Standard Code for Information Interchange VIQR: VIetnamese Quoted-Readable Specification Revision 1.1'', RFC 1456, May 1993.

[RFC1468] Murai, J., Crispin, M., and E. van der Poel, ``Japanese Character Encoding for Internet Messages'', RFC 1468, Keio University, Panda Programming, June 1993.

[RFC1489] Chernov, A., ``Registration of a Cyrillic Character Set'', RFC1489, RELCOM Development Team, July 1993.

[RFC1554] Ohta, M., and K. Handa, ``ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP'', RFC1554, Tokyo Institute of Technology, ETL, December 1993.

[RFC1556] Nussbacher, H., ``Handling of Bi-directional Texts in MIME'', RFC1556, Israeli Inter-University, December 1993.

[RFC1557] Choi, U., Chon, K., and H. Park, ``Korean Character Encoding for Internet Messages'', KAIST, Solvit Chosun Media, December 1993.

[RFC1641] Goldsmith, D., and M. Davis, ``Using Unicode with MIME'', RFC1641, Taligent, Inc., July 1994.

[RFC1642] Goldsmith, D., and M. Davis, ``UTF-7'', RFC1642, Taligent, Inc., July 1994.

[RFC1815] Ohta, M., ``Character Sets ISO-10646 and ISO-10646-J-1'', RFC 1815, Tokyo Institute of Technology, July 1995.




[Adobe]    Adobe Systems Incorporated, PostScript Language Reference

           Manual, second edition, Addison-Wesley Publishing Company,

           Inc., 1990.

[ECMA Registry] ISO-IR: International Register of Escape Sequences http://www.itscj.ipsj.or.jp/ISO-IE/ Note: The current registration authority is IPSJ/ITSCJ, Japan.

[HP-PCL5] Hewlett-Packard Company, ``HP PCL 5 Comparison Guide'', (P/N 5021-0329) pp B-13, 1996.

[IBM-CIDT] IBM Corporation, ``ABOUT TYPE: IBM's Technical Reference for Core Interchange Digitized Type'', Publication number S544-3708-01

[RFC1842] Wei, Y., J. Li, and Y. Jiang, ``ASCII Printable Characters-Based Chinese Character Encoding for Internet Messages'', RFC 1842, Harvard University, Rice University, University of Maryland, August 1995.

[RFC1843] Lee, F., ``HZ - A Data Format for Exchanging Files of Arbitrarily Mixed Chinese and ASCII Characters'', RFC 1843, Stanford University, August 1995.

[RFC2152] Goldsmith, D., M. Davis, ``UTF-7: A Mail-Safe Transformation
Format of Unicode'', RFC 2152, Apple Computer, Inc.,
Taligent Inc., May 1997.

[RFC2279] Yergeau, F., ``UTF-8, A Transformation Format of ISO 10646'', RFC 2279, Alis Technologies, January, 1998.

[RFC2781] Hoffman, P., Yergeau, F., ``UTF-16, an encoding of ISO 10646'', RFC 2781, February 2000.

[RFC3629] Yergeau, F., ``UTF-8, a transformation format of ISO 10646'', RFC3629, November 2003.

PEOPLE ------

[KXS2] Keld Simonsen <Keld.Simonsen&dkuug.dk>

[Choi] Woohyong Choi <whchoi&cosmos.kaist.ac.kr>

[Davis] Mark Davis, <mark&unicode.org>, April 2002.

[Kalyanasundaram] Kuppuswamy Kalyanasundaram, <kalyan.geo@yahoo. com>, 14 May 2007.

[Kikkarin] Sairan M. Kikkarin, <sairan&sci.kz>, 7 December 2006.

[Lazhintseva] Katya Lazhintseva, <katyal&MICROSOFT.com>, May 1996.

[Mahdi] Tamer Mahdi, <tamer&ca.ibm.com>, August 2000.

[Malyshev] Michael Malyshev, <michael_malyshev&mail.ru>, January 2004

[Murai] Jun Murai <jun&wide.ad.jp>

[Nussbacher] Hank Nussbacher, <hank&vm.tau.ac.il>

[Ohta] Masataka Ohta, <mohta&cc.titech.ac.jp>, July 1995.

[Phipps] Toby Phipps, <tphipps&peoplesoft.com>, March 2002.

[Pond] Rick Pond, <rickpond&vnet.ibm.com>, March 1997.

[Robrigado] Reuel Robrigado, <reuelr&ca.ibm.com>, September 2002.

[Scherer] Markus Scherer, <markus.scherer&jtcsv.com>, August 2000, September 2002.

[Simonsen] Keld Simonsen, <Keld.Simonsen&rap.dk>, August 2000.

[Tantsetthi] Trin Tantsetthi, <trin&mozart.inet.co.th>, September 1998.

[Thibault] Samuel Thibault, <samuel.thibault&ens-lyon.org>, 7 December 2006.

[Tumasonis] Vladas Tumasonis, <vladas.tumasonis&maf.vu.lt>, August 2000.

[Uskov] Alexander Uskov, <auskov&idc.kz>, September 2002.

[Veremeev] Alexei Veremeev, <Alexey.Veremeev&oracle.com>, 7 December 2006.

[Wendt] Chris Wendt, <christw&microsoft.com>, December 1999.

[Yick] Nicky Yick, <cliac&itsd.gcn.gov.hk>, October 2000.

[]

EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE } # _init_data

1;

__END__

Programminig
Wy
Wy
yW
Wy
Programming
Wy
Wy
Wy
Wy