pemistahl / lingua-pyLinks
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
β1,523Updated this week
Alternatives and similar repositories for lingua-py
Users that are interested in lingua-py are comparing it to the libraries listed below
Sorting:
- Port of Google's language-detection library to Python.β1,851Updated 7 months ago
- ππ―pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.β878Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.β1,172Updated 3 weeks ago
- π§Ή Python package for text cleaningβ997Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β760Updated last month
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languagesβ1,243Updated last year
- Spelling corrector in pythonβ487Updated 3 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β847Updated last month
- NeuSpell: A Neural Spelling Correction Toolkitβ697Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ358Updated 6 months ago
- βοΈContextual word checker for better suggestions (not actively maintained)β416Updated 8 months ago
- Efficient few-shot learning with Sentence Transformersβ2,587Updated 2 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ177Updated 4 months ago
- A Collection of BM25 Algorithms in Pythonβ1,253Updated last year
- π¦ Integrating LLMs into structured NLP pipelinesβ1,324Updated 9 months ago
- 80x faster and 95% accurate language identification with Fasttextβ161Updated last year
- π Process PDFs, Word documents and more with spaCyβ784Updated 7 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,492Updated this week
- Python wrapper for Wikipediaβ702Updated this week
- Library for translating between 200 languages. Built on π€ transformers.β494Updated last year
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.β1,322Updated last month
- Single-document unsupervised keyword extractionβ1,795Updated last month
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ1,368Updated last month
- Training open neural machine translation modelsβ380Updated 7 months ago
- Open neural machine translation models and web servicesβ737Updated 4 months ago
- Fuzzy string matching, grouping, and evaluation.β785Updated 3 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.β255Updated 2 years ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024β2,441Updated last week
- Benchmarking PDF librariesβ314Updated 3 months ago
- a free python grammar checker πββ495Updated last month