saffsd / langid.py
Stand-alone language identification system
☆2,297Updated 4 years ago
Related projects: ⓘ
- Port of Google's language-detection library to Python.☆1,709Updated 7 months ago
- Multilingual text (NLP) processing toolkit☆2,307Updated 10 months ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,092Updated this week
- Extract Keywords from sentence or Replace keywords in sentences.☆5,578Updated 2 months ago
- NLP, before and after spaCy☆2,206Updated 11 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,131Updated 2 months ago
- TextRank implementation for Python 3.☆1,246Updated last year
- A python implementation of the Rapid Automatic Keyword Extraction☆973Updated 4 years ago
- KenLM: Faster and Smaller Language Model Queries☆2,492Updated last month
- Module for automatic summarization of text documents and HTML pages.☆3,506Updated 4 months ago
- 🦆 Contextually-keyed word vectors☆1,617Updated 6 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆3,974Updated 2 years ago
- ☆1,112Updated this week
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,129Updated 3 months ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆807Updated last month
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,040Updated 2 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆696Updated 6 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,261Updated 3 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,357Updated last week
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,802Updated 2 months ago
- Reading Wikipedia to Answer Open-Domain Questions☆4,474Updated 11 months ago
- A tool for extracting plain text from Wikipedia dumps☆3,732Updated 3 months ago
- Just the facts -- web page content extraction☆1,244Updated 2 months ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,249Updated 2 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,835Updated last month
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆933Updated 5 months ago
- Python interface to Google word2vec☆2,568Updated last year
- Heuristic based boilerplate removal tool☆717Updated 4 months ago
- extract text from any document. no muss. no fuss.☆3,865Updated this week
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,179Updated 2 years ago