Apiary provides modules which can be combined to create a federated cloud data lake
☆37Apr 3, 2024Updated 2 years ago
Alternatives and similar repositories for apiary
Users that are interested in apiary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Service for automatically managing and cleaning up unreferenced data☆50Updated this week
- Terraform scripts for deploying Apiary Data Lake☆19Apr 16, 2026Updated 2 weeks ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆93Mar 5, 2024Updated 2 years ago
- A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service☆13Mar 26, 2026Updated last month
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Insights Explorer is a tool to catalogue and present analytical & research work.☆13Nov 26, 2024Updated last year
- Stream Discovery and Stream Orchestration☆123Jan 7, 2026Updated 3 months ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 2 months ago
- ☆13Jun 27, 2023Updated 2 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- A user friendly API for checking for and reporting on Avro schema incompatibilities.☆59Mar 5, 2024Updated 2 years ago
- A mongoose plugin for logging activities☆10Feb 1, 2023Updated 3 years ago
- Mutation testing framework and code coverage for Hive SQL☆24May 11, 2021Updated 4 years ago
- deck.gl plugins for Superset☆19Dec 10, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Mar 5, 2024Updated 2 years ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- Oxia Java client SDK☆19Apr 17, 2026Updated last week
- Use an Arduino with with USB HID support to control a project in Git☆13Jan 3, 2012Updated 14 years ago
- Discogs Wantlist Monitor saves manually searching through your wantlist for local listings☆12Apr 1, 2026Updated 3 weeks ago
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- Ecosystem website for Apache Flink☆12Jan 22, 2024Updated 2 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Trino ODBC driver☆14Jan 10, 2024Updated 2 years ago
- Pulsar consumer clients offering priority consumption☆12Mar 17, 2023Updated 3 years ago
- ☆21Mar 21, 2025Updated last year
- Go wrapper for LMDB - OpenLDAP Lightning Memory-Mapped Database☆11Feb 25, 2018Updated 8 years ago
- Proxy for S3☆19Apr 15, 2026Updated 2 weeks ago
- Temporary repository for implementing tensor factorization algorithms on Apache Spark☆13Nov 27, 2017Updated 8 years ago
- Repository for Lab “Distributed Big Data Analytics” (MA-INF 4223), University of Bonn☆10Aug 11, 2022Updated 3 years ago
- Semantic Web library and tool for retrieval and deployment of data from/to GIT, CKAN, MAVEN repos and triple stores using DCAT as the bac…☆13Mar 12, 2024Updated 2 years ago
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Arduino library for the Microchip MCP4261☆19Jan 6, 2022Updated 4 years ago
- An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.☆14May 11, 2022Updated 3 years ago
- Python manager for spark-submit jobs☆10Jan 6, 2024Updated 2 years ago
- Citadel: Enterprise Search☆15May 2, 2023Updated 2 years ago
- 📚 The official zircle-UI documentation website.☆12Aug 23, 2022Updated 3 years ago
- QTag: Turbocharge Your SQL Comments☆12Jan 30, 2025Updated last year
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago