ExpediaGroup/shunting-yard

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ExpediaGroup/shunting-yard)

ExpediaGroup / shunting-yard

Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.

☆20

Alternatives and similar repositories for shunting-yard

Users that are interested in shunting-yard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ExpediaGroup / circus-train
View on GitHub
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆93Mar 5, 2024Updated 2 years ago
ExpediaGroup / beekeeper
View on GitHub
Service for automatically managing and cleaning up unreferenced data
☆50Apr 24, 2026Updated 3 months ago
HiveRunner / mutant-swarm
View on GitHub
Mutation testing framework and code coverage for Hive SQL
☆24May 11, 2021Updated 5 years ago
ExpediaGroup / beeju
View on GitHub
JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs
☆26Jul 22, 2025Updated last year
openaire / vipe
View on GitHub
Tool for visualizing Apache Oozie pipelines
☆13Feb 15, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ExpediaGroup / drone-fly
View on GitHub
A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service
☆13Jun 30, 2026Updated 3 weeks ago
NitinSPatil15 / Project-3-Data-Warehouse-with-AWS
View on GitHub
An ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables
☆16May 5, 2020Updated 6 years ago
Pathairush / rdbms_to_hdfs_data_pipeline
View on GitHub
A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).
☆15Jun 3, 2021Updated 5 years ago
nreco / presto-ado
View on GitHub
ADO.NET Provider for Presto/Trino
☆13Oct 3, 2022Updated 3 years ago
Namek / TheConsole
View on GitHub
JavaScriptable shell
☆15Oct 15, 2016Updated 9 years ago
10xfuturetechnologies / kafka-connect-iceberg
View on GitHub
Kafka Connector for Iceberg tables
☆16Jul 24, 2023Updated 3 years ago
ExpediaGroup / apiary
View on GitHub
Apiary provides modules which can be combined to create a federated cloud data lake
☆38Apr 3, 2024Updated 2 years ago
skilld-labs / trino-odbc
View on GitHub
A Trino ODBC driver
☆15Jan 10, 2024Updated 2 years ago
yhyyz / kafka-cdc-redshift
View on GitHub
kafka-cdc-redshift
☆13Jul 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cloudera-labs / hms-mirror
View on GitHub
"hms-mirror" is a utility used to bridge the gap between two clusters and migrate hive metadata.
☆18Nov 8, 2025Updated 8 months ago
leonLMR / presto-es
View on GitHub
presto's elasticsearch connector
☆11Dec 7, 2016Updated 9 years ago
ExpediaGroup / jasvorno
View on GitHub
A library for strong, schema based conversion between 'natural' JSON documents and Avro
☆18Mar 5, 2024Updated 2 years ago
nineinchnick / trino-git
View on GitHub
A Trino connector to access git repository contents
☆17Feb 9, 2026Updated 5 months ago
timveil / docker-hadoop
View on GitHub
Simple functional examples of running Hadoop + Hive in Docker with Docker Compose
☆24Dec 25, 2022Updated 3 years ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
homeaway / datapull
View on GitHub
Cloud based Data Platform based on Apache Spark
☆28Jun 30, 2026Updated 3 weeks ago
ggear / cloudera-framework
View on GitHub
☆11Feb 14, 2020Updated 6 years ago
spotify / hadoop-openpgp-codec
View on GitHub
Codec for Hadoop adding OpenPGP encryption using Bouncy Castle
☆17Aug 18, 2011Updated 14 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HiveRunner / HiveRunner
View on GitHub
An Open Source unit test framework for Hive queries based on JUnit 4 and 5
☆262Jan 6, 2025Updated last year
albertovpd / automated_etl_google_cloud-social_dashboard
View on GitHub
A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315
☆28Oct 30, 2021Updated 4 years ago
ExpediaGroup / javro
View on GitHub
JSON Schema to Avro Mapper
☆28Mar 5, 2024Updated 2 years ago
codingforentrepreneurs / Serverless-Python-Workflow-with-AWS-Lambda
View on GitHub
A tutorial to setup and deploy a simple Serverless Python workflow with REST API endpoints in AWS Lambda.
☆22Apr 22, 2020Updated 6 years ago
BlueGranite / tpc-ds-dataset-generator
View on GitHub
Generate big TPC-DS datasets with Databricks
☆21Jan 3, 2022Updated 4 years ago
CoxAutomotiveDataSolutions / spark-distcp
View on GitHub
A re-implementation of Hadoop DistCP in Apache Spark
☆47Dec 20, 2023Updated 2 years ago
ExpediaGroup / rhapsody
View on GitHub
Reactive Streams framework with support for at-least-once processing
☆32Mar 7, 2024Updated 2 years ago
ExpediaGroup / heat
View on GitHub
Heat Test Framework
☆47Mar 5, 2024Updated 2 years ago
Natural-Intelligence / openLineage-openMetadata-transporter
View on GitHub
Transporter for integrating OpenLineage with OpenMetadata
☆18Sep 10, 2025Updated 10 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mvanderlee / aiotrino
View on GitHub
☆21Mar 21, 2025Updated last year
trinodb / aws-proxy
View on GitHub
Proxy for S3
☆20Updated this week
TonyTromp / HevyConnect
View on GitHub
A Hevy to Garmin Connect file converter (and FIT file viewer).
☆17Dec 15, 2025Updated 7 months ago
StyraOSS / opa-java-wasm
View on GitHub
OPA Wasm rules using Chicory as the runtime
☆32Jul 16, 2026Updated last week
ExpediaGroup / plunger
View on GitHub
A unit testing framework for the Cascading data processing platform.
☆25Aug 25, 2021Updated 4 years ago
ExpediaGroup / stream-registry
View on GitHub
Stream Discovery and Stream Orchestration
☆124Jan 7, 2026Updated 6 months ago
hermannpencole / nifi-swagger-client
View on GitHub
Client swagger for nifi with security
☆39May 20, 2022Updated 4 years ago