browning / comment-troll-classifierLinks
Determine if a web comment is spam or not using naive Bayes. Trained on youtube comments.
☆92Updated 13 years ago
Alternatives and similar repositories for comment-troll-classifier
Users that are interested in comment-troll-classifier are comparing it to the libraries listed below
Sorting:
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆79Updated 11 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆141Updated 13 years ago
- Entry for the Third Annual GitHub Data Challenge☆35Updated 10 years ago
- Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2☆110Updated 13 years ago
- Natural Language Generator for Python☆27Updated 8 years ago
- Topic modeling web application☆40Updated 10 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 13 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Discover repositories you should be following on Github.☆31Updated 13 years ago
- ☆20Updated 8 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 10 years ago
- Download *ALL* the submissions from Hacker News☆51Updated 11 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆30Updated 10 months ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 13 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- API server for TextBlob: Sentiment analysis, POS tagging, noun phrase extraction.☆24Updated 10 years ago
- Python utilities for detecting textual reuse☆21Updated 9 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- MITIE: library and tools for information extraction☆29Updated 10 years ago
- Predicting closed questions on Stack Overflow☆44Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 6 years ago
- Scrapes public information off of LinkedIn☆111Updated 9 years ago
- Toy question answering program. Aimed at "Who ....?" questions, e.g., "Who invented the C programming language?"☆38Updated 8 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 4 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Bringing sanity to world of messed-up data☆66Updated 11 years ago