JoKnopp / dmoz2db
A database importer for the open directory project (aka dmoz) data
☆20Updated 10 years ago
Alternatives and similar repositories for dmoz2db:
Users that are interested in dmoz2db are comparing it to the libraries listed below
- Dmoz RDF parser☆28Updated 8 years ago
- Determine if a web comment is spam or not using naive Bayes. Trained on youtube comments.☆92Updated 12 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2☆111Updated 12 years ago
- A simple crawler in python☆25Updated 12 years ago
- Toy question answering program. Aimed at "Who ....?" questions, e.g., "Who invented the C programming language?"☆39Updated 8 years ago
- ☆224Updated 9 years ago
- iCQA - Intelligent Community Question Answering Framework☆32Updated 8 years ago
- A relativistic solution to Einstein's equations and a ray tracer for that solution☆13Updated 9 years ago
- Markov Bot based on bigram probabilities to generate tweets from your tweet history.☆21Updated 7 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Updated 8 years ago
- Code for the CIFAR-10 competition at Kaggle, uses cuda-convnet☆44Updated 10 years ago
- Experiment on text summarization techniques and exploring Tensorflow.☆15Updated 7 years ago
- PredictionIO Complementary Purchase Engine Template (Scala-based parallelized engine)☆16Updated 5 years ago
- Content based Recommender System which implements sentiment analysis(Naive Bayes,SVMs) on Amazon product reviews. Built in Python(Beautif…☆10Updated 10 years ago
- Datasets and notebooks☆13Updated 8 years ago
- Web page segmentation and noise removal☆55Updated 11 months ago
- Data science tools from Moz☆22Updated 8 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- An implementation of Dell Zhang's solution to Wikipedia's Participation Challenge on Kaggle☆11Updated 13 years ago
- Log-Bilinear Document Model☆18Updated 13 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 8 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated 7 months ago