pemistahl / lingua-pyLinks
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,376Updated last week
Alternatives and similar repositories for lingua-py
Users that are interested in lingua-py are comparing it to the libraries listed below
Sorting:
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆827Updated last month
- Efficient few-shot learning with Sentence Transformers☆2,486Updated last month
- 80x faster and 95% accurate language identification with Fasttext☆155Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 4 months ago
- 🧹 Python package for text cleaning☆979Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆763Updated 3 weeks ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,175Updated last week
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆851Updated 9 months ago
- Article extraction benchmark: dataset and evaluation scripts☆315Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,051Updated 2 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆615Updated 2 months ago
- Heuristic based boilerplate removal tool☆780Updated 3 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,254Updated 4 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆325Updated last month
- Spelling corrector in python☆482Updated 5 months ago
- Python bindings to PDFium☆578Updated this week
- Rapid fuzzy string matching in Python using various string metrics☆3,129Updated last week
- A python based HTML to text conversion library, command line client and Web service.☆306Updated 2 months ago
- Port of Google's language-detection library to Python.☆1,804Updated 3 months ago
- Fuzzy String Matching in Python☆3,242Updated 3 months ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆754Updated 7 months ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,224Updated last year
- A Collection of BM25 Algorithms in Python☆1,173Updated 7 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆4,309Updated 2 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆160Updated 2 weeks ago
- a free python grammar checker 📝✅☆466Updated last month
- Single-document unsupervised keyword extraction☆1,731Updated this week
- A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational e…☆896Updated last year
- Minimal keyword extraction with BERT☆3,870Updated 2 months ago
- Python Keyphrase Extraction module☆1,582Updated last year