pemistahl / lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,104Updated this week
Related projects: ⓘ
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆696Updated 6 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆791Updated last week
- Port of Google's language-detection library to Python.☆1,709Updated 7 months ago
- 🧹 Python package for text cleaning☆946Updated last year
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆682Updated last week
- Spelling corrector in python☆449Updated 9 months ago
- Rapid fuzzy string matching in Python using various string metrics☆2,614Updated this week
- Python bindings to PDFium☆349Updated this week
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆782Updated 3 weeks ago
- Fuzzy string matching, grouping, and evaluation.☆736Updated 3 months ago
- Efficient few-shot learning with Sentence Transformers☆2,143Updated this week
- NeuSpell: A Neural Spelling Correction Toolkit☆662Updated last year
- ✔️Contextual word checker for better suggestions☆405Updated 5 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆3,449Updated last week
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆267Updated last month
- a free python grammar checker 📝✅☆423Updated 3 weeks ago
- Single-document unsupervised keyword extraction☆1,626Updated 8 months ago
- Fuzzy String Matching in Python☆2,763Updated 6 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,073Updated last month
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,129Updated 3 months ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,146Updated 8 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆225Updated last year
- 📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.☆429Updated 3 months ago
- Open neural machine translation models and web services☆598Updated 2 months ago
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,096Updated 2 weeks ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆1,256Updated this week
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆857Updated this week
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆961Updated last week
- NLP, before and after spaCy☆2,206Updated 11 months ago
- A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.☆740Updated last week