LBeaudoux / iso639Links
A fast, comprehensive, ISOΒ 639 library.
β41Updated 3 weeks ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python implementation of Lunr.js πβ197Updated 4 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ166Updated last month
- Python3 bindings for the Compact Language Detector v3 (CLD3)β152Updated 2 years ago
- A Python library for working with and comparing language codes.β345Updated 2 months ago
- Simple streaming JSON parser and encoder.β165Updated 5 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ115Updated 4 months ago
- Python API for PDF documentsβ123Updated 10 months ago
- Parse numbers written in natural languageβ119Updated 8 months ago
- python library to simplify working with jsonlines and ndjson dataβ295Updated 11 months ago
- Python binding to Poppler-cpp pdf libraryβ110Updated 10 months ago
- 80x faster and 95% accurate language identification with Fasttextβ158Updated last year
- A python based HTML to text conversion library, command line client and Web service.β312Updated last month
- Python interface to Apache PDFBox command-line tools.β75Updated 2 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.β207Updated 3 weeks ago
- Python bindings for Tantivyβ342Updated 2 weeks ago
- ASCII transliterations of Unicode text - GitHub mirrorβ571Updated 2 months ago
- β522Updated last month
- ISO 639 library for Pythonβ33Updated 10 months ago
- Fast and robust date extraction from web pages, with Python or on the command-lineβ133Updated 6 months ago
- A python package for grapheme aware string handlingβ112Updated 3 years ago
- β170Updated 3 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ338Updated 3 months ago
- Library for unit extraction - fork of quantulum for python3β141Updated last year
- Show the differences between two strings/text as a compact text, in markdown/HTML, in the terminal and more.β134Updated 3 weeks ago
- Find parts of long text or data, allowing for some changes/typos.β325Updated last month
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ73Updated 2 weeks ago
- Python port of Boilerpipe libraryβ88Updated 10 months ago
- Targetted language identifier, based on FastText and Hunspell.β36Updated 5 months ago
- Complete lxml external type annotationβ63Updated 2 weeks ago