jacksonllee / iso639Links
ISO 639 language codes
☆46Updated 5 months ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆345Updated 3 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Pythonic search engine based on PyLucene.☆128Updated 8 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆107Updated 2 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆74Updated last month
- Confection: the sweetest config system for Python☆188Updated 4 months ago
- Accurately find/replace/remove emojis in text strings☆163Updated last year
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- ☆172Updated 4 months ago
- A Python implementation of Lunr.js 🌖☆198Updated 5 months ago
- Language detection using Spacy and Fasttext☆57Updated last year
- Python Finite-State Toolkit☆57Updated last week
- Faster, modernized fork of the language identification tool langid.py☆56Updated 8 months ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆205Updated last week
- Abydos NLP/IR library for Python☆188Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆168Updated 2 months ago
- A python package to simulate typographical errors.☆36Updated last year
- ISO 639 library for Python☆34Updated 11 months ago
- Parse numbers written in natural language☆122Updated 9 months ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated this week
- Hy-phen-ation made easy☆211Updated 5 months ago
- 80x faster and 95% accurate language identification with Fasttext☆160Updated last year
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆472Updated 6 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆100Updated last year
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆98Updated 2 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated last month
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago