rjurney / Collecting-Data
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
☆31Updated 12 years ago
Alternatives and similar repositories for Collecting-Data:
Users that are interested in Collecting-Data are comparing it to the libraries listed below
- ActiveColumn is a data management framework for Cassandra. It includes data migrations similar to ActiveRecord, and a data mapping frame…☆53Updated last year
- A JRuby DSL for Cascading☆42Updated 9 years ago
- Originally for monthly table partitions, more info at [imperialwicket.com](http://imperialwicket.com/postgresql-automating-monthly-table-…☆43Updated 9 years ago
- Bulk loading for elastic search☆185Updated last year
- Easy Map/Reduce with Hadoop and Ruby. Also see http://github.com/forward/mandy-lab for examples.☆45Updated 13 years ago
- Example code for running R on Hadoop☆133Updated 12 years ago
- Round robin database pattern via Redis sorted sets☆79Updated 14 years ago
- Ruby-based programmatic access to Amazon's Elastic MapReduce service.☆106Updated 4 years ago
- Piglet is a DSL for writing Pig scripts in Ruby☆84Updated 14 years ago
- Empower Curiosity / Redshift analytics platform☆77Updated 3 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Updated 12 years ago
- Dynamic Visualization LEGO☆129Updated 6 months ago
- ☆33Updated 6 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Realtime Analytics☆68Updated 12 years ago
- Collaborative filtering with node, redis and lua☆13Updated 13 years ago
- juttle execution engine☆37Updated 8 years ago
- It counts☆61Updated 12 years ago
- Ruby interface to Hadoop's HDFS via Thrift☆50Updated 11 years ago
- A REST API for Mozilla Metrics services.☆57Updated 5 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Updated 13 years ago
- Aggregating NBA data☆31Updated 11 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Presto connector to Amazon Kinesis service.☆14Updated 5 years ago
- aggregate composite metrics for cassandra using counters☆16Updated 13 years ago
- Sample code for Cascalog on Hadoop, a New Hope☆20Updated 11 years ago
- A sample application that consumes from twitter using HBC and producing into Amazon Kinesis☆12Updated 9 years ago
- Using Hadoop by Ruby script, supported by JRuby. Not Hadoop streaming.☆56Updated 14 years ago