LBeaudoux / iso639Links
A fast, comprehensive, ISO 639 library.
☆47Updated 6 months ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆353Updated 9 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆185Updated 8 months ago
- Python binding to Poppler-cpp pdf library☆113Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- python library to simplify working with jsonlines and ndjson data☆306Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆86Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 3 months ago
- ISO 639 language codes☆52Updated last week
- Parse numbers written in natural language☆126Updated last year
- 80x faster and 95% accurate language identification with Fasttext☆164Updated 2 years ago
- A Python implementation of Lunr.js 🌖☆204Updated 11 months ago
- Python API for PDF documents☆124Updated last year
- ASCII transliterations of Unicode text - GitHub mirror☆597Updated last month
- Find parts of long text or data, allowing for some changes/typos.☆339Updated 3 months ago
- Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.☆310Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A python based HTML to text conversion library, command line client and Web service.☆334Updated 2 months ago
- ISO 639 library for Python☆35Updated last year
- Simple streaming JSON parser and encoder.☆184Updated 3 months ago
- Python package for Google's diff-match-patch native C++ implementation.☆87Updated last year
- Show the differences between two strings/text as a compact text, in markdown/HTML, in the terminal and more.☆156Updated this week
- Python port of Boilerpipe library☆96Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆158Updated last month
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- A python package for grapheme aware string handling☆115Updated 3 years ago
- Truly universal encoding detector in pure Python.☆735Updated last week
- Pythonic search engine based on PyLucene.☆132Updated last month
- Next-generation Punkt sentence boundary detection with zero dependencies☆27Updated 2 months ago
- Targetted language identifier, based on FastText and Hunspell.☆38Updated 5 months ago
- Parallel and LAzY Analyzer for PDFs 🏖️☆38Updated last week