Class CharacterSet

  • All Implemented Interfaces:
    Serializable, Comparable<CharacterSet>, ControlledVocabulary

    @Deprecated
    @UML(identifier="MD_CharacterSetCode",
         specification=ISO_19115)
    public final class CharacterSet
    extends CodeList<CharacterSet>
    Deprecated.
    As of ISO 19115:2014, replaced by a reference to the IANA Character Set register. Represented in Java by Charset.
    Name of the character coding standard used for the resource. The UML identifiers were defined in ISO 19115:2003, but removed from ISO 19115:2014. Mapping from legacy UML identifiers to the IANA names is as below:
     # From ISO 19115:2003 to java.nio.charset
     ucs2       = UCS-2
     ucs4       = UCS-4
     utf7       = UTF-7
     utf8       = UTF-8
     utf16      = UTF-16
     8859part1  = ISO-8859-1
     8859part2  = ISO-8859-2
     8859part3  = ISO-8859-3
     8859part4  = ISO-8859-4
     8859part5  = ISO-8859-5
     8859part6  = ISO-8859-6
     8859part7  = ISO-8859-7
     8859part8  = ISO-8859-8
     8859part9  = ISO-8859-9
     8859part10 = ISO-8859-10
     8859part11 = ISO-8859-11
     8859part12 = ISO-8859-12
     8859part13 = ISO-8859-13
     8859part14 = ISO-8859-14
     8859part15 = ISO-8859-15
     8859part16 = ISO-8859-16
     jis        = JIS_X0201
     shiftJIS   = Shift_JIS
     eucJP      = EUC-JP
     usAscii    = US-ASCII
     ebcdic     = EBCDIC
     eucKR      = EUC-KR
     big5       = Big5
     GB2312     = GB2312
    This mapping is provided by GeoAPI in the org/opengis/metadata/2003/charset-codes.properties file. This file can be read as below:
     final Properties codes = new Properties();
     try (InputStream in = Metadata.class.getResourceAsStream("2003/charset-codes.properties")) {
         codes.load(in);
     }
    Since:
    2.1
    See Also:
    Serialized Form
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static CharacterSet BIG_5
      Deprecated.
      Traditional Chinese code set used in Taiwan, Hong Kong, and other areas.
      static CharacterSet EBCDIC
      Deprecated.
      IBM mainframe code set.
      static CharacterSet EUC_JP
      Deprecated.
      Japanese code set used on UNIX based machines.
      static CharacterSet EUC_KR
      Deprecated.
      Korean code set.
      static CharacterSet GB2312
      Deprecated.
      Simplified Chinese code set.
      static CharacterSet ISO_8859_1
      Deprecated.
      ISO/IEC 8859-1, Information technology. 8-bit single-byte coded graphic character sets - Part 1: Latin alphabet No. 1.
      static CharacterSet ISO_8859_10
      Deprecated.
      ISO/IEC 8859-10, Information technology. 8-bit single-byte coded graphic character sets - Part 10: Latin alphabet No. 6.
      static CharacterSet ISO_8859_11
      Deprecated.
      ISO/IEC 8859-11, Information technology. 8-bit single-byte coded graphic character sets - Part 11: Latin/Thai alphabet.
      static CharacterSet ISO_8859_12
      Deprecated.
      A future ISO/IEC 8-bit single-byte coded graphic character set.
      static CharacterSet ISO_8859_13
      Deprecated.
      ISO/IEC 8859-13, Information technology. 8-bit single-byte coded graphic character sets - Part 13: Latin alphabet No. 7.
      static CharacterSet ISO_8859_14
      Deprecated.
      ISO/IEC 8859-14, Information technology. 8-bit single-byte coded graphic character sets - Part 14: Latin alphabet No. 8 (Celtic).
      static CharacterSet ISO_8859_15
      Deprecated.
      ISO/IEC 8859-15, Information technology. 8-bit single-byte coded graphic character sets - Part 15: Latin alphabet No. 9.
      static CharacterSet ISO_8859_16
      Deprecated.
      ISO/IEC 8859-16, Information technology. 8-bit single-byte coded graphic character sets - Part 16: Latin alphabet No. 10.
      static CharacterSet ISO_8859_2
      Deprecated.
      ISO/IEC 8859-2, Information technology. 8-bit single-byte coded graphic character sets - Part 2: Latin alphabet No. 2.
      static CharacterSet ISO_8859_3
      Deprecated.
      ISO/IEC 8859-3, Information technology. 8-bit single-byte coded graphic character sets - Part 3: Latin alphabet No. 3.
      static CharacterSet ISO_8859_4
      Deprecated.
      ISO/IEC 8859-4, Information technology. 8-bit single-byte coded graphic character sets - Part 4: Latin alphabet No. 4.
      static CharacterSet ISO_8859_5
      Deprecated.
      ISO/IEC 8859-5, Information technology. 8-bit single-byte coded graphic character sets - Part 5: Latin/Cyrillic alphabet.
      static CharacterSet ISO_8859_6
      Deprecated.
      ISO/IEC 8859-6, Information technology. 8-bit single-byte coded graphic character sets - Part 6: Latin/Arabic alphabet.
      static CharacterSet ISO_8859_7
      Deprecated.
      ISO/IEC 8859-7, Information technology. 8-bit single-byte coded graphic character sets - Part 7: Latin/Greek alphabet.
      static CharacterSet ISO_8859_8
      Deprecated.
      ISO/IEC 8859-8, Information technology. 8-bit single-byte coded graphic character sets - Part 8: Latin/Hebrew alphabet.
      static CharacterSet ISO_8859_9
      Deprecated.
      ISO/IEC 8859-9, Information technology. 8-bit single-byte coded graphic character sets - Part 9: Latin alphabet No. 5.
      static CharacterSet JIS
      Deprecated.
      Japanese code set used for electronic transmission.
      static CharacterSet SHIFT_JIS
      Deprecated.
      Japanese code set used on MS-DOS based machines.
      static CharacterSet UCS_2
      Deprecated.
      16-bit fixed size Universal Character Set, based on ISO/IEC 10646.
      static CharacterSet UCS_4
      Deprecated.
      32-bit fixed size Universal Character Set, based on ISO/IEC 10646.
      static CharacterSet US_ASCII
      Deprecated.
      United States ASCII code set (ISO 646 US).
      static CharacterSet UTF_16
      Deprecated.
      16-bit variable size UCS Transfer Format, based on ISO/IEC 10646.
      static CharacterSet UTF_7
      Deprecated.
      7-bit variable size UCS Transfer Format, based on ISO/IEC 10646.
      static CharacterSet UTF_8
      Deprecated.
      8-bit variable size UCS Transfer Format, based on ISO/IEC 10646.