pemistahl / lingua-pyLinks
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,625Updated 2 months ago
Alternatives and similar repositories for lingua-py
Users that are interested in lingua-py are comparing it to the libraries listed below
Sorting:
- Spelling corrector in python☆491Updated 6 months ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆901Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,232Updated last week
- 🧹 Python package for text cleaning☆1,000Updated this week
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆769Updated 2 months ago
- Port of Google's language-detection library to Python.☆1,870Updated 10 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Updated last week
- A Collection of BM25 Algorithms in Python☆1,299Updated last year
- a free python grammar checker 📝✅☆505Updated 2 weeks ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 11 months ago
- Efficient few-shot learning with Sentence Transformers☆2,673Updated last month
- Python wrapper for Wikipedia☆714Updated this week
- Fuzzy string matching, grouping, and evaluation.☆787Updated 6 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆704Updated 2 years ago
- Open neural machine translation models and web services☆767Updated 2 weeks ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- 80x faster and 95% accurate language identification with Fasttext☆164Updated 2 years ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,465Updated last month
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆283Updated 4 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,362Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆380Updated last week
- A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational e…☆912Updated 2 years ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,246Updated 2 years ago
- Single-document unsupervised keyword extraction☆1,817Updated last month
- Rapid fuzzy string matching in Python using various string metrics☆3,690Updated this week
- Minimal keyword extraction with BERT☆4,086Updated 3 months ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,569Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,237Updated 4 months ago