LBeaudoux / iso639
A fast, comprehensive, ISO 639 library.
☆38Updated 2 months ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- Python binding to Poppler-cpp pdf library☆110Updated 8 months ago
- A Python library for working with and comparing language codes.☆346Updated last week
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆114Updated 2 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆156Updated this week
- A Python implementation of Lunr.js 🌖☆195Updated 2 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆71Updated last year
- Language detection using Spacy and Fasttext☆55Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆72Updated last week
- ISO 639 language codes☆44Updated 2 months ago
- Pandoc (Python Library)☆156Updated 8 months ago
- Python bindings for Milli, the embeddable Rust-based search engine powering Meilisearch☆131Updated 11 months ago
- ISO 639 library for Python☆33Updated 8 months ago
- UUID version 7, which are time-sortable (following the Peabody RFC4122 draft)☆107Updated 3 months ago
- An OCR evaluation tool☆65Updated 2 weeks ago
- Hy-phen-ation made easy☆212Updated 2 months ago
- 80x faster and 95% accurate language identification with Fasttext☆153Updated last year
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.☆34Updated 3 months ago
- Pure python implementation of identifying files based off their magic numbers☆192Updated last week
- Simple streaming JSON parser and encoder.☆159Updated 3 months ago
- A modern CSS selector implementation for BeautifulSoup☆234Updated last week
- Parse numbers written in natural language☆114Updated 6 months ago
- Python binding to Ammonia HTML sanitizer Rust crate☆287Updated 2 weeks ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year
- Python package for Google's diff-match-patch native C++ implementation.☆77Updated 11 months ago
- OCR-D python tools☆33Updated 8 months ago
- python xml for humans☆195Updated 3 weeks ago
- Allowlist-based HTML cleaner☆142Updated 4 months ago
- python library to simplify working with jsonlines and ndjson data☆292Updated 9 months ago