datasalt / pangoolLinks
Tuple MapReduce for Hadoop: Hadoop API made easy
☆56Updated 3 years ago
Alternatives and similar repositories for pangool
Users that are interested in pangool are comparing it to the libraries listed below
Sorting:
- It counts☆61Updated 12 years ago
- (DEPRECATED. This project is no longer used or maintained at LiveRamp.) Hank is a high performance distributed key-value NoSQL database t…☆174Updated 5 years ago
- Ordasity is Boundary's library for building stateful clustered services on the JVM.☆345Updated 2 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆66Updated 13 years ago
- OOM diagnostics for Java.☆22Updated 13 years ago
- Utilities for dealing with Apache Zookeeper☆42Updated 12 years ago
- A REST API for Mozilla Metrics services.☆57Updated 6 years ago
- ☆33Updated 6 years ago
- High performance, memory-limited adaptive histogram class.☆47Updated 6 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆173Updated 13 years ago
- DDSL - Dynamic Distributed Service Locator☆102Updated 10 years ago
- A lightweight platform monitoring tool for Java VMs☆156Updated 8 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Updated 14 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆558Updated 11 years ago
- iSAX Indexing persisted in HBase☆39Updated 14 years ago
- A variable length record, checksumming, append only rotating log implementation with graceful recovery☆14Updated 13 years ago
- HBase adapters for Cascading☆46Updated 16 years ago
- Continuous Streaming SQL Queries for Flume☆95Updated 13 years ago
- Examples of using Akka and 0MQ in Java, separately and together.☆50Updated 14 years ago
- Halflife is now a part of Reactor, and is available under Reactor Pipe☆16Updated 9 years ago
- A Lazy Data Flow Framework (no longer active - see Apache TinkerPop)☆278Updated 4 years ago
- A restful web application for real-time typeahead and autocomplete☆105Updated 12 years ago
- ☆29Updated 7 years ago
- Realtime Analytics☆68Updated 12 years ago
- S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…☆234Updated 14 years ago
- Some utilities for Lucene☆111Updated 12 years ago
- Mirror of Apache Whirr☆94Updated 8 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆90Updated 12 years ago
- A distributed task queue worker designed for throughput, parallelism, and clustering.☆238Updated 2 years ago
- Simple bash functions for manipulating Amazon Elastic MapReduce clusters☆45Updated 9 years ago