rjurney / Collecting-DataLinks
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
☆31Updated 12 years ago
Alternatives and similar repositories for Collecting-Data
Users that are interested in Collecting-Data are comparing it to the libraries listed below
Sorting:
- ActiveColumn is a data management framework for Cassandra. It includes data migrations similar to ActiveRecord, and a data mapping frame…☆53Updated last year
- Command-line utilities for data analysis.☆18Updated 14 years ago
- Empower Curiosity / Redshift analytics platform☆77Updated 3 years ago
- Dynamic Visualization LEGO☆129Updated 10 months ago
- Bulk loading for elastic search☆185Updated last year
- Originally for monthly table partitions, more info at [imperialwicket.com](http://imperialwicket.com/postgresql-automating-monthly-table-…☆43Updated 9 years ago
- Sample code for Cascalog on Hadoop, a New Hope☆20Updated 11 years ago
- Example code for running R on Hadoop☆132Updated 12 years ago
- Realtime Analytics☆68Updated 12 years ago
- HBase adapters for Cascading☆46Updated 15 years ago
- Collaborative filtering with node, redis and lua☆13Updated 14 years ago
- A REST API for Mozilla Metrics services.☆57Updated 6 years ago
- It counts☆61Updated 12 years ago
- Ruby-based programmatic access to Amazon's Elastic MapReduce service.☆105Updated 4 years ago
- aggregate composite metrics for cassandra using counters☆15Updated 13 years ago
- Ruby interface to Hadoop's HDFS via Thrift☆50Updated 11 years ago
- ☆33Updated 6 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆174Updated 12 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A Python wrapper for Cascading☆222Updated 5 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 10 years ago
- R driver for MongoDB☆82Updated 12 years ago
- A JRuby DSL for Cascading☆42Updated 9 years ago
- Luigi Plugin for Hubot☆36Updated 8 years ago
- Easy Map/Reduce with Hadoop and Ruby. Also see http://github.com/forward/mandy-lab for examples.☆45Updated 13 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- Dockerized spotify/luigi scheduler using nginx as proxy for the dashboard and mysql for task history database☆10Updated 7 years ago
- A very simple django site for demonstrating the use of rpy for plotting☆64Updated 10 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆583Updated 11 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 11 years ago