airbnb / airpal
Web UI for PrestoDB.
☆2,762Updated 3 years ago
Related projects: ⓘ
- A machine learning package built for humans.☆4,797Updated 3 years ago
- Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules☆4,387Updated 2 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,214Updated this week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,646Updated last year
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,224Updated 4 years ago
- Distributed Graph Database☆5,248Updated last year
- Apache Pinot - A realtime distributed OLAP datastore☆5,393Updated this week
- Pinball is a scalable workflow manager☆1,048Updated 4 years ago
- SQL-based streaming analytics platform at scale☆1,222Updated 4 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,039Updated last year
- An interactive data exploration UI for Druid☆646Updated 7 years ago
- REST job server for Apache Spark☆2,844Updated 2 months ago
- ☆881Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆31Updated last year
- The leader in Next-Generation Customer Data Infrastructure☆6,817Updated 2 weeks ago
- Realtime analytics, this includes the core components of Pulsar pipeline.☆654Updated 8 years ago
- Teletraan is Pinterest's deploy system.☆1,804Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,374Updated this week
- High-performance time-series aggregation for PostgreSQL☆2,633Updated 2 years ago
- Change data capture from PostgreSQL into Kafka☆2Updated last year
- Distributed Prometheus time series database☆1,428Updated this week
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,786Updated 3 years ago
- Secor is a service implementing Kafka log persistence☆1,844Updated 3 weeks ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year
- LinkedIn's previous generation Kafka to HDFS pipeline.☆882Updated 4 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,928Updated 3 weeks ago
- ☆1,138Updated this week
- This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open sour…☆692Updated 3 years ago
- Streaming MapReduce with Scalding and Storm☆2,139Updated 2 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,104Updated last year