wardbradt / HTMLSTLinks
A library to extract sentences from HTML
☆11Updated 4 years ago
Alternatives and similar repositories for HTMLST
Users that are interested in HTMLST are comparing it to the libraries listed below
Sorting:
- ☆129Updated 4 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆64Updated 11 years ago
- A thin wrapper around the DBPedia Spotlight REST API☆60Updated last year
- An emotion classifier of text containing technical content from the SE domain☆76Updated 6 months ago
- Language Detection with Infinity-gram☆230Updated 10 years ago
- Train a neural network optimized for generating tweets based off of any number of Twitter users.☆222Updated 7 years ago
- A Deep NN used to generate stories which will tingle your butt.☆40Updated 4 years ago
- This program is a Python XML-RPC server that accepts an English word and returns a continuous value (from 0 to 1, inclusive) on how compl…☆19Updated 9 years ago
- Subjectivity and sentiment classification using polarity lexicons☆91Updated 4 years ago
- English grammar checker code☆43Updated 12 years ago
- a Deep Learning based Speller☆227Updated 7 years ago
- An introduction to using spaCy for NLP and machine learning☆193Updated 3 years ago
- Simple Python Statistical Parser☆111Updated 8 years ago
- Script that can scrape the transcripts of every speech a politician has given (as long as it's been recorded on whatthefolly.com)☆12Updated 8 years ago
- Adaptive crawler which uses Reinforcement Learning methods☆168Updated 2 weeks ago
- A python module to get the emotion of a word.☆75Updated 6 years ago
- A sentence segmenter that actually works!☆304Updated 5 years ago
- The Classical Language Toolkit☆884Updated last week
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- 🗣️ Tool to generate adversarial text examples and test machine learning models against them☆401Updated 4 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 4 months ago
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- a collection of functions that measure the readability of a given body of text☆196Updated 8 years ago
- A library for sentiment analysis in dictionary framework.☆95Updated 6 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 4 years ago
- The Yahoo News Annotated Comments Corpus (YNACC)☆19Updated 7 years ago
- Language detection extension for spaCy 2.0+☆114Updated 6 years ago