ExpediaGroup/apiary

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ExpediaGroup/apiary)

ExpediaGroup / apiary

Apiary provides modules which can be combined to create a federated cloud data lake

☆38

Alternatives and similar repositories for apiary

Users that are interested in apiary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ExpediaGroup / beekeeper
View on GitHub
Service for automatically managing and cleaning up unreferenced data
☆50Apr 24, 2026Updated 2 months ago
ExpediaGroup / circus-train
View on GitHub
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆93Mar 5, 2024Updated 2 years ago
ExpediaGroup / waggle-dance
View on GitHub
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆288Jun 25, 2026Updated 3 weeks ago
ExpediaGroup / stream-registry
View on GitHub
Stream Discovery and Stream Orchestration
☆124Jan 7, 2026Updated 6 months ago
openlookeng / hetu-odbc-driver
View on GitHub
☆13Jun 27, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ExpediaGroup / datasqueeze
View on GitHub
Hadoop utility to compact small files
☆18Feb 16, 2026Updated 5 months ago
ExpediaGroup / shunting-yard
View on GitHub
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
☆20Oct 11, 2021Updated 4 years ago
ExpediaGroup / avro-compatibility
View on GitHub
A user friendly API for checking for and reporting on Avro schema incompatibilities.
☆61Mar 5, 2024Updated 2 years ago
chunkai1312 / mongoose-activitylog
View on GitHub
A mongoose plugin for logging activities
☆10Feb 1, 2023Updated 3 years ago
alexkago / cf-buildpack-r
View on GitHub
Heroku buildpack for R (http://www.r-project.org)
☆11Jul 6, 2015Updated 11 years ago
HiveRunner / mutant-swarm
View on GitHub
Mutation testing framework and code coverage for Hive SQL
☆24May 11, 2021Updated 5 years ago
jetbrains-infra / terraform-aws-vpc-with-private-subnets-and-nat
View on GitHub
VPC with public/private subnets with internet access over NAT
☆27Jul 7, 2018Updated 8 years ago
nreco / presto-ado
View on GitHub
ADO.NET Provider for Presto/Trino
☆13Oct 3, 2022Updated 3 years ago
oxia-db / oxia-client-java
View on GitHub
Oxia Java client SDK
☆21Jul 14, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SANSA-Stack / SANSA-DataLake
View on GitHub
A library to query heterogeneous data sources uniformly using SPARQL
☆12Dec 5, 2023Updated 2 years ago
emnify / jenkins-casc-docker
View on GitHub
Jenkins configuration as code docker image
☆10Nov 10, 2021Updated 4 years ago
big-data-europe / docker-hdfs-filebrowser
View on GitHub
A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.
☆11Sep 20, 2018Updated 7 years ago
jetbrains-infra / terraform-aws-spot-fleet
View on GitHub
AWS Spot fleet terraform module
☆11Apr 26, 2019Updated 7 years ago
leonLMR / presto-es
View on GitHub
presto's elasticsearch connector
☆11Dec 7, 2016Updated 9 years ago
ververica / flink-ecosystem
View on GitHub
Ecosystem website for Apache Flink
☆12Jan 22, 2024Updated 2 years ago
ververica / lab-sql-vs-datastream
View on GitHub
Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API
☆14Apr 15, 2020Updated 6 years ago
nineinchnick / trino-git
View on GitHub
A Trino connector to access git repository contents
☆17Feb 9, 2026Updated 5 months ago
dimajix / terraform-emr-training
View on GitHub
Terraform script for launching multiple EMR clusters for training purposes.
☆16Oct 30, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
msackman / gomdb
View on GitHub
Go wrapper for LMDB - OpenLDAP Lightning Memory-Mapped Database
☆11Feb 25, 2018Updated 8 years ago
thoughtworks / byor-voting-web-app
View on GitHub
☆14May 1, 2023Updated 3 years ago
SmartDataAnalytics / MA-INF-4223-DBDA-Lab
View on GitHub
Repository for Lab “Distributed Big Data Analytics” (MA-INF 4223), University of Bonn
☆10Aug 11, 2022Updated 3 years ago
rchukh / trino-querylog
View on GitHub
Trino plugin for logging query events into a separate log file.
☆40Nov 16, 2022Updated 3 years ago
PApostol / spark-submit
View on GitHub
Python manager for spark-submit jobs
☆10Jul 6, 2026Updated 2 weeks ago
inveniosoftware-contrib / citadel-search
View on GitHub
Citadel: Enterprise Search
☆15May 2, 2023Updated 3 years ago
superorbital / aws-eks-blueprint-examples
View on GitHub
An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.
☆14May 11, 2022Updated 4 years ago
aws-samples / aws-modernization-gitops-with-weaveworks
View on GitHub
☆11Jan 10, 2025Updated last year
noahgift / pass-any-aws-exam
View on GitHub
How to pass any
☆12Jun 7, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sundeck-io / qtag
View on GitHub
QTag: Turbocharge Your SQL Comments
☆12Jan 30, 2025Updated last year
aws-samples / aws-dog-no-dog
View on GitHub
Dog no Dog is a sample application to showcase how to build a serverless MVP with serverless technologies on AWS.
☆18Feb 19, 2022Updated 4 years ago
zircleUI / docs
View on GitHub
📚 The official zircle-UI documentation website.
☆12Aug 23, 2022Updated 3 years ago
lightbend / flink-k8s-operator
View on GitHub
An example of building kubernetes operator (Flink) using Abstract operator's framework
☆26Jul 12, 2019Updated 7 years ago
mvanderlee / aiotrino
View on GitHub
☆21Mar 21, 2025Updated last year
jurriaan / ruby-dacpclient
View on GitHub
A DACP (iTunes Remote protocol) client written in the wonderful Ruby language
☆23Nov 16, 2016Updated 9 years ago
rackspace-infrastructure-automation / aws-terraform-vpc_basenetwork
View on GitHub
☆19Jan 15, 2025Updated last year