facebookarchive / hive-io-experimentalLinks
Hive I/O Library
☆66Updated 3 years ago
Alternatives and similar repositories for hive-io-experimental
Users that are interested in hive-io-experimental are comparing it to the libraries listed below
Sorting:
- Muppet☆127Updated 4 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- Integration for Cascading and Apache Hive☆25Updated 7 years ago
- Elephant Twin is a framework for creating indexes in Hadoop☆97Updated 4 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Updated 3 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- PowerMock is a Java framework that allows you to unit test code normally regarded as untestable.☆32Updated 7 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆76Updated 3 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Aerospike Spark Connector☆35Updated 8 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 12 years ago
- Mirror of Apache Samza☆113Updated 11 months ago
- A scala dsl for dataflow☆11Updated 10 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Updated 6 years ago
- Graph Analytics Engine☆260Updated 10 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Updated last year
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- A framework called "pinspider" on Apache mesos, to get basic user information from a pinterest page of a user.☆18Updated 10 years ago
- ☆205Updated 2 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Oryx 2 (incubating): Lambda architecture on Spark for real-time large scale machine learning☆14Updated 4 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆244Updated 10 years ago
- API Hub is a web UI for browsing and searching a catalog of Rest.li APIs.☆74Updated 6 years ago
- Templates for projects based on top of H2O.☆38Updated 5 months ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago