Mimino666 / langdetect
Port of Google's language-detection library to Python.
☆1,729Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for langdetect
- Stand-alone language identification system☆2,324Updated 4 years ago
- Multilingual text (NLP) processing toolkit☆2,316Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,263Updated 3 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆951Updated 8 months ago
- spellchecking library for python☆601Updated 5 months ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,068Updated 3 weeks ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,150Updated 4 months ago
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.☆1,059Updated last year
- NLP, before and after spaCy☆2,217Updated last year
- A python implementation of the Rapid Automatic Keyword Extraction☆975Updated 4 years ago
- Heuristic based boilerplate removal tool☆729Updated 6 months ago
- ☆165Updated 5 months ago
- Convert HTML to Markdown-formatted text.☆1,847Updated 3 months ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,160Updated 3 weeks ago
- A tool for extracting plain text from Wikipedia dumps☆3,753Updated 5 months ago
- A simple Python module for parsing human names into their individual components☆658Updated 5 months ago
- A Python Implementation of Simhash Algorithm☆982Updated 2 years ago
- A library implementing different string similarity and distance measures using Python.☆992Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆801Updated this week
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆975Updated 2 weeks ago
- extract text from any document. no muss. no fuss.☆3,910Updated this week
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆808Updated 3 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,152Updated 5 months ago
- 🦆 Contextually-keyed word vectors☆1,625Updated 8 months ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆831Updated 3 months ago
- A python binding for crfsuite☆771Updated last month
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,806Updated 4 months ago
- python parser for human readable dates☆2,560Updated last week
- Python Keyphrase Extraction module☆1,565Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,667Updated last month