pemistahl / lingua-pyLinks
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
β1,584Updated 3 weeks ago
Alternatives and similar repositories for lingua-py
Users that are interested in lingua-py are comparing it to the libraries listed below
Sorting:
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.β1,210Updated last week
- ππ―pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.β890Updated last year
- Port of Google's language-detection library to Python.β1,862Updated 9 months ago
- Spelling corrector in pythonβ492Updated 5 months ago
- π§Ή Python package for text cleaningβ998Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β851Updated 2 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β767Updated 2 weeks ago
- a free python grammar checker πββ501Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ368Updated 3 weeks ago
- βοΈContextual word checker for better suggestions (not actively maintained)β418Updated 10 months ago
- π Process PDFs, Word documents and more with spaCyβ824Updated 9 months ago
- π¦ Integrating LLMs into structured NLP pipelinesβ1,355Updated 11 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ180Updated 6 months ago
- Fuzzy string matching, grouping, and evaluation.β786Updated 5 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,599Updated this week
- Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.β1,488Updated last week
- 80x faster and 95% accurate language identification with Fasttextβ163Updated last year
- A python based HTML to text conversion library, command line client and Web service.β331Updated last month
- Single-document unsupervised keyword extractionβ1,801Updated 2 weeks ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ1,427Updated 2 weeks ago
- A Collection of BM25 Algorithms in Pythonβ1,275Updated last year
- Python wrapper for Wikipediaβ709Updated this week
- Heuristic based boilerplate removal toolβ809Updated 9 months ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languagesβ1,241Updated last year
- Library for translating between 200 languages. Built on π€ transformers.β495Updated last year
- Python bindings to PDFium, reasonably cross-platform.β692Updated this week
- NeuSpell: A Neural Spelling Correction Toolkitβ702Updated 2 years ago
- A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.β1,909Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024β2,603Updated 3 weeks ago
- Fuzzy String Matching in Pythonβ3,518Updated 9 months ago