rjurney / Collecting-Data
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
☆31Updated 12 years ago
Alternatives and similar repositories for Collecting-Data:
Users that are interested in Collecting-Data are comparing it to the libraries listed below
- ActiveColumn is a data management framework for Cassandra. It includes data migrations similar to ActiveRecord, and a data mapping frame…☆53Updated last year
- Command-line utilities for data analysis.☆18Updated 14 years ago
- Realtime Analytics☆68Updated 12 years ago
- A Ruby client library for Scribe☆90Updated 14 years ago
- A JRuby DSL for Cascading☆42Updated 9 years ago
- Ruby interface to Hadoop's HDFS via Thrift☆50Updated 11 years ago
- Example code for running R on Hadoop☆133Updated 12 years ago
- HBase adapters for Cascading☆46Updated 15 years ago
- Ruby-based programmatic access to Amazon's Elastic MapReduce service.☆106Updated 4 years ago
- Bulk loading for elastic search☆185Updated last year
- ☆33Updated 6 years ago
- Dynamic Visualization LEGO☆129Updated 6 months ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- Originally for monthly table partitions, more info at [imperialwicket.com](http://imperialwicket.com/postgresql-automating-monthly-table-…☆43Updated 9 years ago
- Sample code for Cascalog on Hadoop, a New Hope☆20Updated 11 years ago
- Hazelcast is a dropin replacement for the Play! Framework cache. Hazelcast provide also some other services like Clustered Executors, Map…☆22Updated 8 years ago
- Easy Map/Reduce with Hadoop and Ruby. Also see http://github.com/forward/mandy-lab for examples.☆45Updated 13 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- A very simple django site for demonstrating the use of rpy for plotting☆64Updated 10 years ago
- Round robin database pattern via Redis sorted sets☆79Updated 14 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Updated 13 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 10 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆84Updated 14 years ago
- Collaborative filtering with node, redis and lua☆13Updated 13 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Updated 12 years ago
- Empower Curiosity / Redshift analytics platform☆77Updated 3 years ago
- A REST API for Mozilla Metrics services.☆57Updated 5 years ago
- Using Hadoop by Ruby script, supported by JRuby. Not Hadoop streaming.☆56Updated 14 years ago
- Example Repository for Building Complex Data Pipeline with Luigi +TD☆24Updated 9 years ago