Mimino666 / langdetectLinks
Port of Google's language-detection library to Python.
☆1,841Updated 6 months ago
Alternatives and similar repositories for langdetect
Users that are interested in langdetect are comparing it to the libraries listed below
Sorting:
- Stand-alone language identification system☆2,421Updated 5 years ago
- Multilingual text (NLP) processing toolkit☆2,352Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,156Updated last week
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆755Updated last week
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.☆1,073Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,279Updated 4 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆843Updated last week
- python parser for human readable dates☆2,717Updated 3 weeks ago
- spellchecking library for python☆614Updated this week
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,489Updated 3 months ago
- Heuristic based boilerplate removal tool☆794Updated 6 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,026Updated 3 months ago
- TextRank implementation for Python 3.☆1,262Updated 2 years ago
- NLP, before and after spaCy☆2,228Updated last year
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,037Updated 4 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,196Updated 2 months ago
- A library implementing different string similarity and distance measures using Python.☆1,017Updated 2 years ago
- extract text from any document. no muss. no fuss.☆4,292Updated 9 months ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆873Updated last year
- ASCII transliterations of Unicode text - GitHub mirror☆587Updated last week
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,314Updated 2 weeks ago
- 🦆 Contextually-keyed word vectors☆1,657Updated 4 months ago
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,616Updated 5 months ago
- Find dates inside text using Python and get back datetime objects☆662Updated last year
- A python implementation of the Rapid Automatic Keyword Extraction☆978Updated 5 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,491Updated 5 months ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆884Updated this week
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- A collection of common regular expressions bundled with an easy to use interface.☆1,578Updated 2 years ago