browning / comment-troll-classifier
Determine if a web comment is spam or not using naive Bayes. Trained on youtube comments.
☆92Updated 12 years ago
Related projects: ⓘ
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 9 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 11 years ago
- Natural Language Generator for Python☆27Updated 7 years ago
- Predicting closed questions on Stack Overflow☆46Updated 6 years ago
- OpenTC is a text classification engine using several algorithms in machine learning☆26Updated 4 years ago
- General Architecture for Text Engineering☆45Updated 8 years ago
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆78Updated 10 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated 3 months ago
- Public Machine Learning and Data Competition Repo☆54Updated 8 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 2 years ago
- Datasets and notebooks☆13Updated 7 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Updated 10 years ago
- Topic modeling web application☆39Updated 9 years ago
- A Python module to fetch and parse results from different search engines.☆77Updated 5 years ago
- (BROKEN, help wanted)☆15Updated 8 years ago
- Data science tools from Moz☆22Updated 7 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Content based Recommender System which implements sentiment analysis(Naive Bayes,SVMs) on Amazon product reviews. Built in Python(Beautif…☆10Updated 9 years ago
- Temporal Anomaly Detector (TAD)☆14Updated 6 years ago
- ☆34Updated this week
- ☆22Updated 9 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆50Updated 9 years ago
- Similarity search on Wikipedia using gensim in Python.☆61Updated 5 years ago
- Facebook Crawler - Crawl information from facebook☆42Updated 11 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- Generating the next read for our book club- with Data Science!☆40Updated 8 years ago