echen / data-hacksLinks
Command-line utilities for data analysis.
☆18Updated 14 years ago
Alternatives and similar repositories for data-hacks
Users that are interested in data-hacks are comparing it to the libraries listed below
Sorting:
- HBase adapters for Cascading☆46Updated 15 years ago
- It counts☆61Updated 12 years ago
- ☆33Updated 6 years ago
- aggregate composite metrics for cassandra using counters☆15Updated 13 years ago
- Realtime Analytics☆41Updated 13 years ago
- This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.☆31Updated 12 years ago
- Redesign to eliminate all string identifiers and hide partitioning details from app developer.☆16Updated 13 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- Realtime Analytics☆68Updated 12 years ago
- A very memory-efficient trie (radix tree) implementation☆47Updated 13 years ago
- Store batched Kafka messages in S3.☆40Updated 3 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 9 years ago
- Patched version of Cloudera's Distribution of Hadoop with Mesos support☆13Updated 13 years ago
- An integration framework that allows you to run and manage CrateDB via Apache Mesos.☆23Updated 6 years ago
- Presto connector to Amazon Kinesis service.☆14Updated 6 years ago
- Collaborative filtering with node, redis and lua☆13Updated 14 years ago
- A toy school project intended to be an approximate clone of Google's Megastore database for geographically-distributed scalable fault-to…☆35Updated 13 years ago
- DDSL - Dynamic Distributed Service Locator☆101Updated 9 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 9 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Utilities for dealing with Apache Zookeeper☆41Updated 12 years ago
- Allows a Storm topology to consume an AMQP exchange as an input source.☆54Updated 12 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Updated 4 years ago
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Updated 9 years ago
- Hive Storage Handler for Kinesis.☆11Updated 10 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 11 years ago
- Sample code for Cascalog on Hadoop, a New Hope☆20Updated 11 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- A JVM Kestrel client that aggregates queues from multiple servers. Implemented in Scala with Java bindings. In use at Twitter for all JVM…☆56Updated 8 years ago
- Lucene based indexing in Cassandra☆61Updated 9 years ago