facebookarchive / hive-io-experimentalLinks
Hive I/O Library
☆66Updated 3 years ago
Alternatives and similar repositories for hive-io-experimental
Users that are interested in hive-io-experimental are comparing it to the libraries listed below
Sorting:
- Elephant Twin is a framework for creating indexes in Hadoop☆97Updated 4 years ago
- Muppet☆127Updated 4 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- Twitter's fork of Apache BookKeeper (will push changes upstream eventually)☆59Updated 6 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Netflix's In-Memory Data Propagation Framework☆200Updated last year
- Mirror of Apache Samza☆113Updated 11 months ago
- Integration for Cascading and Apache Hive☆25Updated 7 years ago
- Serving system for batch generated data sets☆177Updated 8 years ago
- Pig Visualization framework☆466Updated 2 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Updated 3 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Jetstream is a streaming processing framework☆114Updated 10 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- PowerMock is a Java framework that allows you to unit test code normally regarded as untestable.☆32Updated 7 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- ☆205Updated 2 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆76Updated 3 years ago
- JSR166e for Twitter☆28Updated 11 years ago
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆332Updated 6 years ago
- Aerospike Spark Connector☆35Updated 8 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- Load balancer API☆112Updated last year
- A simple test of Avro 1.5 capabilities including dynamic typing, untagged (compact) data storage and schema evolution.☆36Updated 14 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Apache Spark applications☆70Updated 7 years ago
- Graph Analytics Engine☆260Updated 11 years ago