Apiary provides modules which can be combined to create a federated cloud data lake
☆37Apr 3, 2024Updated last year
Alternatives and similar repositories for apiary
Users that are interested in apiary are comparing it to the libraries listed below
Sorting:
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆92Mar 5, 2024Updated 2 years ago
- A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service☆13Updated this week
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆284Feb 24, 2026Updated 3 weeks ago
- Insights Explorer is a tool to catalogue and present analytical & research work.☆13Nov 26, 2024Updated last year
- Stream Discovery and Stream Orchestration☆122Jan 7, 2026Updated 2 months ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated last month
- ☆13Jun 27, 2023Updated 2 years ago
- giter8 template for Spark Jobserver☆12Jan 19, 2018Updated 8 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- A user friendly API for checking for and reporting on Avro schema incompatibilities.☆59Mar 5, 2024Updated 2 years ago
- Heroku buildpack for R (http://www.r-project.org)☆11Jul 6, 2015Updated 10 years ago
- deck.gl plugins for Superset☆19Dec 10, 2021Updated 4 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Mar 5, 2024Updated 2 years ago
- ADO.NET Provider for Presto/Trino☆13Oct 3, 2022Updated 3 years ago
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago
- Use an Arduino with with USB HID support to control a project in Git☆13Jan 3, 2012Updated 14 years ago
- Entity-level code review for Git. Graph-based risk scoring, change classification, commit untangling. 95% recall on the Greptile benchmar…☆53Mar 13, 2026Updated last week
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 5 years ago
- Ecosystem website for Apache Flink☆12Jan 22, 2024Updated 2 years ago
- A Trino connector to access git repository contents☆18Feb 9, 2026Updated last month
- A Trino ODBC driver☆14Jan 10, 2024Updated 2 years ago
- Pulsar consumer clients offering priority consumption☆12Mar 17, 2023Updated 3 years ago
- ☆21Mar 21, 2025Updated last year
- Go wrapper for LMDB - OpenLDAP Lightning Memory-Mapped Database☆11Feb 25, 2018Updated 8 years ago
- Proxy for S3☆18Feb 13, 2026Updated last month
- ☆15May 1, 2023Updated 2 years ago
- Trino plugin for logging query events into a separate log file.☆40Nov 16, 2022Updated 3 years ago
- Beautiful Repository apache2☆11Oct 13, 2020Updated 5 years ago
- Citadel: Enterprise Search☆15May 2, 2023Updated 2 years ago
- ☆11Jan 10, 2025Updated last year
- 📚 The official zircle-UI documentation website.☆12Aug 23, 2022Updated 3 years ago
- QTag: Turbocharge Your SQL Comments☆12Jan 30, 2025Updated last year
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- Dog no Dog is a sample application to showcase how to build a serverless MVP with serverless technologies on AWS.☆18Feb 19, 2022Updated 4 years ago
- OctArranger is an unofficial, decently sized graphical interface that allows creation and edition of arrangements for the Elektron Octatr…☆14Apr 21, 2020Updated 5 years ago
- Swift library for Bluetooth Web API (WebAssembly)☆18Jul 11, 2025Updated 8 months ago
- Apache Flink demo example☆17Jan 10, 2019Updated 7 years ago
- Pterradactyl is a library developed to abstract Terraform configuration from the Terraform environment setup.☆38Jan 8, 2026Updated 2 months ago
- Fast, simple Free Monads using ScalaMeta macro annotations. Port of Freasy-Monad.☆14Oct 16, 2017Updated 8 years ago