echen / data-hacks
Command-line utilities for data analysis.
☆18Updated 14 years ago
Alternatives and similar repositories for data-hacks:
Users that are interested in data-hacks are comparing it to the libraries listed below
- HBase adapters for Cascading☆46Updated 15 years ago
- ☆33Updated 6 years ago
- Sample code for Cascalog on Hadoop, a New Hope☆20Updated 11 years ago
- It counts☆61Updated 12 years ago
- This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.☆31Updated 12 years ago
- DDSL - Dynamic Distributed Service Locator☆102Updated 9 years ago
- Patched version of Cloudera's Distribution of Hadoop with Mesos support☆13Updated 13 years ago
- aggregate composite metrics for cassandra using counters☆16Updated 13 years ago
- Realtime Analytics☆68Updated 12 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 10 years ago
- A REST API for Mozilla Metrics services.☆57Updated 5 years ago
- Experiments in Streaming☆60Updated 8 years ago
- Allows a Storm topology to consume an AMQP exchange as an input source.☆54Updated 12 years ago
- A small Scala library for writing specs as simple classes and methods (no longer maintained).☆38Updated 7 years ago
- Redesign to eliminate all string identifiers and hide partitioning details from app developer.☆16Updated 13 years ago
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Updated 9 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Updated 3 years ago
- A very memory-efficient trie (radix tree) implementation☆47Updated 12 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Protobuf support for Finagle☆14Updated 2 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Hadoop Cluster Management with Intelligent Defaults☆40Updated 10 years ago
- Mirror of Apache MRUnit☆38Updated 6 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 9 years ago
- Android Live information coming from Twitter☆35Updated 11 years ago
- Tuple MapReduce for Hadoop: Hadoop API made easy☆57Updated 2 years ago
- A JVM Kestrel client that aggregates queues from multiple servers. Implemented in Scala with Java bindings. In use at Twitter for all JVM…☆56Updated 7 years ago
- Examples of using pallet☆29Updated 12 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago