PyYoshi / uchardet
uchardet is an encoding detector library, which takes a sequence of bytes in an unknown character encoding and attempts to determine the encoding of the text. Returned encoding names are iconv-compatible.
☆42Updated 9 months ago
Alternatives and similar repositories for uchardet:
Users that are interested in uchardet are comparing it to the libraries listed below
- sample apps for ICU (formerly icuapps)☆22Updated 3 months ago
- jbig2 decoder using code from pdfium☆9Updated 6 years ago
- Low-level IO utilities for PosgtreSQL drivers.☆35Updated 3 months ago
- Bringing the power of python to stream editing☆50Updated 3 months ago
- A Python binding of SQLite Full Text Search Tokenizer☆47Updated last month
- Python Unicode Block Utilities☆24Updated 4 years ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆39Updated 14 years ago
- Python bindings for RocksDB☆34Updated 2 years ago
- Multithreading Library for Brotli, Lizard, LZ4, LZ5, Snappy and Zstandard☆196Updated last month
- Links recognition library with full unicode support☆18Updated last month
- Mirror of libidn2 repository☆13Updated 3 years ago
- String Matching Algorithms Research Tool☆99Updated 10 months ago
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆69Updated 2 years ago
- Tools reimplemented using Bela library☆32Updated last week
- Using io_uring Linux Kernel interface from Python by JITing C code with MetaCall.☆30Updated 3 years ago
- Python Interpreters Benchmarks☆21Updated last year
- ICU - International Components for Unicode☆12Updated 4 months ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/libgsf☆32Updated 3 months ago
- Mirror of git://git.code.sf.net/p/libwpd/librevenge☆10Updated 9 years ago
- Compares Python's text parsing libraries☆24Updated 3 years ago
- A small python based build file generator targetting the build system ninja☆46Updated 8 years ago
- compact_enc_det - Compact Encoding Detection☆223Updated last year
- Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and…☆58Updated 5 months ago
- A Python library for variable type checker/validator/converter at a run time.☆16Updated 2 months ago
- An SQLite extension library for creating histogram tables, tables of ratio between histograms and interpolation tables of scatter point …☆19Updated 4 years ago
- Seekable, gzip compatible, compression format☆15Updated last year
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆76Updated this week
- C++ client for rqlite☆12Updated 7 years ago
- Fastest general-purpose parsing library for Python with a familiar API☆44Updated last month
- Stand-alone Assertions for Python☆14Updated this week