chardet on Pypi

Chardet: The Universal Character Encoding Detector

Detects

ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)
Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)
EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)
EUC-KR, ISO-2022-KR, Johab (Korean)
KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)
ISO-8859-5, windows-1251 (Bulgarian)
ISO-8859-1, windows-1252, MacRoman (Western European languages)
ISO-8859-7, windows-1253 (Greek)
ISO-8859-8, windows-1255 (Visual and Logical Hebrew)
TIS-620 (Thai)

Note

Our ISO-8859-2 and windows-1250 (Hungarian) probers have been temporarily disabled until we can retrain the models.

Requires Python 3.8+.

Installation

Install from PyPI:

pip install chardet

Documentation

For users, docs are now available at https://chardet.readthedocs.io/.

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0

About

This is a continuation of Mark Pilgrim's excellent original chardet port from C, and Ian Cordasco's charade Python 3-compatible fork.

maintainer:	Dan Blanchard

Stats

Dependencies

Dependent packages

3.49K

Dependent repositories

2.13K

Total releases

Latest release

Aug 1, 2023

First release

Dec 23, 2006

Stars

Forks

Watchers

Contributors

Repository size

4.88 MB

SourceRank

chardet
Release 5.2.0

Release 5.2.0

5.2.0

5.1.0

5.0.0

4.0.0

3.0.4

3.0.3

3.0.2

3.0.1

3.0.0

2.3.0

Documentation

Chardet: The Universal Character Encoding Detector

Installation

Documentation

Command-line Tool

About

Stats

Releases

Contributors

chardet Release 5.2.0

Release 5.2.0 Toggle Dropdown 5.2.0 5.1.0 5.0.0 4.0.0 3.0.4 3.0.3 3.0.2 3.0.1 3.0.0 2.3.0

Documentation

Chardet: The Universal Character Encoding Detector

Installation

Documentation

Command-line Tool

About

Stats

Releases

Contributors

chardet
Release 5.2.0

Release 5.2.0

5.2.0

5.1.0

5.0.0

4.0.0

3.0.4

3.0.3

3.0.2

3.0.1

3.0.0

2.3.0