wikimedia / analytics-kafkateeLinks
Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing
☆21Updated 2 years ago
Alternatives and similar repositories for analytics-kafkatee
Users that are interested in analytics-kafkatee are comparing it to the libraries listed below
Sorting:
- ☆77Updated 9 years ago
- developer repository for https://github.com/fuse-kafka/fuse_kafka☆27Updated 10 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆84Updated 3 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 10 years ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆132Updated 7 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 6 years ago
- Schema and type system for creating sortable byte[]☆47Updated 13 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- ☆45Updated 4 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Updated 8 years ago
- The Chronix Server implementation that is based on Apache Solr.☆266Updated 6 years ago
- A distributed queue built off cassandra☆51Updated 9 years ago
- ☆49Updated 8 years ago
- Atomix Jepsen tests☆14Updated 9 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆116Updated 10 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆85Updated 7 months ago
- Tools for working with parquet, impala, and hive☆134Updated 5 years ago
- The Chronix storage based on Apache Lucene☆47Updated 8 years ago
- Last-seen sketch implementation in Go☆16Updated 5 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 6 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 3 years ago
- Vagrant, Apache Spark and Apache Zeppelin VM for teaching☆44Updated 8 years ago
- Producer daemon for Apache Kafka☆71Updated last year
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 3 years ago
- Storm on Mesos!☆138Updated 4 years ago
- Mozilla Services Data Pipeline☆30Updated 6 years ago
- Lucene based indexing in Cassandra☆62Updated 9 years ago