HubSpot / hbase-support
Supporting configs and tools for HBase at HubSpot
☆17Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for hbase-support
- ☆76Updated 8 years ago
- Multidimensional data storage with rollups for numerical data☆265Updated 10 months ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Cascading on Apache Flink®☆54Updated 9 months ago
- ByteBuffer collection classes for java and jvm-based languages.☆33Updated 6 years ago
- Annotation driven Java object writer for ORC with runtime code generation for speed.☆21Updated last year
- Sql interface to druid.☆77Updated 8 years ago
- A streaming key-value store implementation using native Flink Streaming operators☆22Updated 9 years ago
- XPath likeness for Avro☆35Updated last year
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Mirror of Apache DirectMemory☆53Updated 11 months ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated 8 months ago
- Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.☆172Updated 6 years ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 5 years ago
- A compiler for Pig Latin to Spark and Flink.☆23Updated 5 years ago
- Mirror of Apache Blur☆33Updated 5 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 8 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated last year
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Obsolete - superseded by Apache Calcite☆235Updated 3 years ago
- Quark is a data virtualization engine over analytic databases.☆99Updated 7 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 7 years ago
- Bitmap compression using the CONCISE algorithm☆43Updated 7 years ago
- Integration for Cascading and Apache Hive☆26Updated 7 years ago