pemistahl / lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,294Updated last week
Alternatives and similar repositories for lingua-py:
Users that are interested in lingua-py are comparing it to the libraries listed below
- Port of Google's language-detection library to Python.☆1,772Updated 3 weeks ago
- Spelling corrector in python☆478Updated 3 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆821Updated last week
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆897Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆305Updated this week
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,256Updated 3 weeks ago
- 📚 Process PDFs, Word documents and more with spaCy☆500Updated 3 weeks ago
- NeuSpell: A Neural Spelling Correction Toolkit☆691Updated last year
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆840Updated 7 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 4 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,218Updated 2 months ago
- Fuzzy String Matching in Python☆3,130Updated 3 weeks ago
- 80x faster and 95% accurate language identification with Fasttext☆151Updated last year
- 🧹 Python package for text cleaning☆975Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year
- Rapid fuzzy string matching in Python using various string metrics☆2,987Updated 2 weeks ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆412Updated last month
- Fuzzy string matching, grouping, and evaluation.☆757Updated last month
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 2 months ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆1,885Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,078Updated last week
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,237Updated last month
- a free python grammar checker 📝✅☆457Updated 3 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆736Updated 3 weeks ago
- A Python library for calculating a large variety of metrics from text☆332Updated 3 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆4,069Updated 2 weeks ago
- Truly universal encoding detector in pure Python☆632Updated 3 weeks ago
- Python bindings to PDFium☆552Updated last week
- ASCII transliterations of Unicode text - GitHub mirror☆555Updated 11 months ago
- Open neural machine translation models and web services☆671Updated 3 months ago