pemistahl / lingua-pyLinks
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
β1,399Updated last week
Alternatives and similar repositories for lingua-py
Users that are interested in lingua-py are comparing it to the libraries listed below
Sorting:
- ππ―pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.β853Updated 10 months ago
- Spelling corrector in pythonβ484Updated 5 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ332Updated 2 months ago
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.β1,071Updated this week
- NeuSpell: A Neural Spelling Correction Toolkitβ695Updated last year
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β747Updated last week
- 80x faster and 95% accurate language identification with Fasttextβ157Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.β248Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ163Updated 2 weeks ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β830Updated last month
- a free python grammar checker πββ470Updated 3 weeks ago
- Rapid fuzzy string matching in Python using various string metricsβ3,181Updated this week
- A Collection of BM25 Algorithms in Pythonβ1,190Updated 8 months ago
- β836Updated 2 years ago
- Open neural machine translation models and web servicesβ699Updated last week
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).β1,287Updated last week
- Training open neural machine translation modelsβ365Updated 3 months ago
- A Python library for calculating a large variety of metrics from textβ339Updated 6 months ago
- π¦ Integrating LLMs into structured NLP pipelinesβ1,267Updated 5 months ago
- π Process PDFs, Word documents and more with spaCyβ644Updated 3 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ1,213Updated 3 weeks ago
- π§Ή Python package for text cleaningβ982Updated 2 years ago
- Port of Google's language-detection library to Python.β1,815Updated 3 months ago
- Tools to download and cleanup Common Crawl dataβ1,016Updated 2 years ago
- Minimal keyword extraction with BERTβ3,904Updated 2 months ago
- βοΈContextual word checker for better suggestions (not actively maintained)β414Updated 4 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.β1,296Updated last week
- Super Fast String Matching in Pythonβ369Updated 3 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β154Updated last year
- Article extraction benchmark: dataset and evaluation scriptsβ317Updated last year