net.sf.jmatchparser.util.charset.juniversalchardet
Class JUniversalChardetCharsetProvider
java.lang.Object
java.nio.charset.spi.CharsetProvider
net.sf.jmatchparser.util.charset.juniversalchardet.JUniversalChardetCharsetProvider
public class JUniversalChardetCharsetProvider
- extends CharsetProvider
Charset provider that provides a convenience charset that uses jUniversalChardet to dynamically detect
the charset it uses for decoding.
The name of this charset is "jUniversalChardet".
This class is loaded automatically via SPI when it is in the class path.
Character sets that can be detected
- UTF-8
- Shift_JIS
- EUC-JP
- ISO-2022-JP
- EUC-KR
- ISO-2022-KR
- Big5
- x-euc-tw
- GB2312
- GB18030
- ISO-2022-CN
- HZ-GB-2312 (not supported by Java by default)
- ISO-8859-5
- KOI8-R
- windows-1251
- MacCyrillic
- IBM866
- IBM855
- ISO-8859-7
- windows-1253
- ISO-8859-8
- windows-1255
- windows-1252
- UTF-16BE
- UTF-16LE
- UTF-32BE
- UTF-32LE
- X-ISO-10646-UCS-4-3412 (not supported by Java by default)
- X-ISO-10646-UCS-4-2143 (not supported by Java by default)
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
JUniversalChardetCharsetProvider
public JUniversalChardetCharsetProvider()
charsetForName
public Charset charsetForName(String charsetName)
- Specified by:
charsetForName
in class CharsetProvider
charsets
public Iterator<Charset> charsets()
- Specified by:
charsets
in class CharsetProvider
Copyright © 2011. All Rights Reserved.