Kite SDK
☆393Nov 1, 2022Updated 3 years ago
Alternatives and similar repositories for kite
Users that are interested in kite are comparing it to the libraries listed below
Sorting:
- Kite SDK Examples☆99May 8, 2021Updated 4 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Feb 19, 2026Updated last week
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- ☆24Oct 19, 2015Updated 10 years ago
- Mirror of Apache Eagle☆411Aug 22, 2020Updated 5 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,133Apr 10, 2023Updated 2 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 3 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆517Jan 13, 2020Updated 6 years ago
- Mirror of Apache Slider☆78Dec 11, 2018Updated 7 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,605Updated this week
- Distributed version restore tool for S3☆12Jan 5, 2015Updated 11 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆2,038Updated this week
- source examples to support the "Cascading for the Impatient" blog post series☆80Aug 30, 2016Updated 9 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- REST job server for Apache Spark☆2,843Jul 8, 2025Updated 7 months ago
- Mirror of Apache Crunch (Incubating)☆109Feb 2, 2021Updated 5 years ago
- Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.☆353Apr 8, 2025Updated 10 months ago
- Apache Fluo☆195Sep 24, 2025Updated 5 months ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Oct 1, 2022Updated 3 years ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- Apache Parquet Java☆3,025Feb 25, 2026Updated last week
- Fixed-width data source for Spark SQL and DataFrames☆10Oct 25, 2016Updated 9 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆280Aug 3, 2018Updated 7 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Jan 12, 2023Updated 3 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Command line tools for the parquet project☆44Jul 10, 2018Updated 7 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Generic Data Ingestion & Dispersal Library for Hadoop☆482Mar 19, 2023Updated 2 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 2 months ago
- BDD-style unit-level testing framework for Java/Scala/Groovy. Safely isolates mutable state. Unlimited nesting.☆20Dec 11, 2018Updated 7 years ago
- StreamLine - Streaming Analytics☆167Aug 27, 2023Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago