jacksonllee / iso639
ISO 639 language codes
☆39Updated 2 weeks ago
Alternatives and similar repositories for iso639:
Users that are interested in iso639 are comparing it to the libraries listed below
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated last month
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated last month
- Rust python bindings for symspell☆18Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆24Updated 3 months ago
- Cython wrapper on Hunspell Dictionary☆67Updated 8 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 10 months ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 3 years ago
- Python Finite-State Toolkit☆52Updated last week
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 3 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- ☆30Updated 2 years ago
- Build and upload fastText Python wheels to PyPI☆23Updated last year
- A Python library for working with and comparing language codes.☆344Updated 2 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆78Updated 5 months ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆23Updated 6 months ago
- A python package to simulate typographical errors.☆32Updated last year
- A Python implementation of Lunr.js 🌖☆196Updated this week
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆105Updated 3 weeks ago
- Confection: the sweetest config system for Python☆183Updated 9 months ago
- List of corpora annotated for coreference for different languages☆17Updated 7 months ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Updated 4 years ago
- 💫 A spaCy package for Yohei Tamura's Rust tokenizations library☆27Updated last year
- A flexible sentence segmentation library using CRF model and regex rules☆29Updated last year