verisign / trumpet
HA, fault-tolerant, non-intrusive INotify for Hadoop HDFS
☆18Updated last year
Alternatives and similar repositories for trumpet:
Users that are interested in trumpet are comparing it to the libraries listed below
- Hive + Avro. Serde for working with Avro in Hive☆59Updated last year
- Cascading on Apache Flink®☆54Updated 11 months ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Updated 11 years ago
- Use Avro to store all your values in HBase instead of regular columns☆75Updated 7 years ago
- Utility to easily copy files into HDFS☆69Updated 4 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Updated 10 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆91Updated 11 years ago
- Sample Spark Streaming application for secure consumption from Kafka☆33Updated 7 years ago
- Kafka Partitions Assignment Optimizer☆17Updated 8 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 5 years ago
- A library to expose more of Apache Spark's metrics system☆146Updated 5 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Continuous Streaming SQL Queries for Flume☆95Updated 13 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Updated 13 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆36Updated 8 months ago
- Kafka as Hive Storage☆66Updated 10 years ago
- Using Hadoop with Scala☆71Updated 11 years ago
- Library to use Kestrel as a spout within Storm☆134Updated 7 years ago
- Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.☆348Updated 7 months ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 8 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- iSAX Indexing persisted in HBase☆39Updated 13 years ago
- Examples of use of pig scripting languages capabilities☆39Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Kite SDK Examples☆99Updated 3 years ago
- functionstest☆33Updated 8 years ago