petewarden / crunchcrawlLinks
A project to gather, analyze and visualized the data in Crunchbase
☆46Updated 13 years ago
Alternatives and similar repositories for crunchcrawl
Users that are interested in crunchcrawl are comparing it to the libraries listed below
Sorting:
- Jeremy's Machine Learning Library☆52Updated 9 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆174Updated 12 years ago
- Pretty fast parser for probabilistic context free grammars☆87Updated 12 years ago
- Fast and intuitive exploratory data analysis☆96Updated 9 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- Realtime Analytics☆68Updated 12 years ago
- natural language processing with link-grammar☆18Updated 15 years ago
- A GPU Database☆146Updated 7 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- The reference implementation of the SPEAR ranking algorithm in Python.☆37Updated 9 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Cloud9 is a Hadoop toolkit for working with big data☆237Updated 9 years ago
- A simple system for archiving and OCRing documents built for cloud-friendly search and backup.☆22Updated 4 years ago
- Github contest☆40Updated 15 years ago
- Python-based utility for managing various distributed services on cloud providers☆63Updated 11 years ago
- Mongodb Power web search index☆18Updated 13 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- Implementation of the FriendFeed Schema-less MySQL Pattern☆87Updated 14 years ago
- Bulk loading for elastic search☆185Updated last year
- A platform for real-time streaming search☆102Updated 9 years ago
- A command-line twitter client with smart filtering and statistical classification☆165Updated 14 years ago
- Data visualization for your database.☆83Updated 10 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Social Graph Analysis using Elastic MapReduce and PyPy☆55Updated 14 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- A scrapy-based Hacker News crawler.☆151Updated 12 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- A Python wrapper for Cascading☆222Updated 5 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago