Kite SDK
☆394Nov 1, 2022Updated 3 years ago
Alternatives and similar repositories for kite
Users that are interested in kite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kite SDK Examples☆99May 8, 2021Updated 5 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,266May 15, 2026Updated last week
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Aug 16, 2021Updated 4 years ago
- Mirror of Apache Eagle☆411Aug 22, 2020Updated 5 years ago
- Fixed-width data source for Spark SQL and DataFrames☆10Oct 25, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Mirror of Apache Crunch (Incubating)☆110Feb 2, 2021Updated 5 years ago
- Command line tools for the parquet project☆44Jul 10, 2018Updated 7 years ago
- ☆24Oct 19, 2015Updated 10 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,621Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Oct 5, 2022Updated 3 years ago
- Apache Parquet Java☆3,058May 18, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 6 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- Mirror of Apache Slider☆79Dec 11, 2018Updated 7 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,977May 12, 2026Updated 2 weeks ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆520Jan 13, 2020Updated 6 years ago
- A small library to add some convenience methods to Scala encompassing predicate logic☆21Mar 16, 2016Updated 10 years ago
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago
- REST job server for Apache Spark☆2,843Mar 3, 2026Updated 2 months ago
- Distributed version restore tool for S3☆12Jan 5, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,112Jan 12, 2023Updated 3 years ago
- Base classes to use when writing tests with Spark☆1,554Apr 20, 2026Updated last month
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,559May 15, 2026Updated last week
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆282Feb 27, 2019Updated 7 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Mar 26, 2015Updated 11 years ago
- Mirror of Apache DataFu☆124May 15, 2026Updated last week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,195Apr 29, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- source examples to support the "Cascading for the Impatient" blog post series☆79Aug 30, 2016Updated 9 years ago
- Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.☆354Apr 8, 2025Updated last year
- Streaming MapReduce with Scalding and Storm☆2,126Jan 19, 2022Updated 4 years ago
- ☆14Jan 12, 2017Updated 9 years ago
- CMAK is a tool for managing Apache Kafka clusters☆11,934Aug 2, 2023Updated 2 years ago
- Serverless proxy for Spark cluster☆325Apr 13, 2026Updated last month
- Mirror of Apache Toree (Incubating)☆750May 15, 2026Updated last week