JoKnopp / dmoz2db
A database importer for the open directory project (aka dmoz) data
☆20Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for dmoz2db
- Dmoz RDF parser☆28Updated 8 years ago
- ☆224Updated 9 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- Determine if a web comment is spam or not using naive Bayes. Trained on youtube comments.☆92Updated 12 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 11 years ago
- Data science tools from Moz☆22Updated 7 years ago
- Performs user classification into labels using a set of seed Twitter users with known labels and the structure of the interaction network…☆11Updated 7 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Updated 2 years ago
- Cross platform middleware for Social Networking Services: Twitter, Facebook, SinaWeibo, Renren, RSS, Email, Sqlite, ... (more coming)☆160Updated 2 years ago
- Extract opionion phrases from user reviews☆62Updated 10 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆34Updated 9 years ago
- Experiment on text summarization techniques and exploring Tensorflow.☆15Updated 7 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆207Updated 11 years ago
- ☆41Updated 4 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 11 years ago
- Feed discovery to share :)☆40Updated 8 years ago
- PredictionIO Complementary Purchase Engine Template (Scala-based parallelized engine)☆16Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Toy question answering program. Aimed at "Who ....?" questions, e.g., "Who invented the C programming language?"☆39Updated 7 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- Using Scrapy to get company profiles from http://crunchbase.com☆31Updated 11 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆29Updated 10 years ago
- Training a classifier to reddit's TIL to find new things on Wikipedia☆35Updated 9 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆23Updated 8 years ago
- Theano based deep ANN learning code☆38Updated 14 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 7 years ago
- Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree☆31Updated 12 years ago