The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
☆32Jun 6, 2025Updated 10 months ago
Alternatives and similar repositories for data-integration-library
Users that are interested in data-integration-library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Playground site for creating/validating data contracts☆11Aug 9, 2025Updated 8 months ago
- Welcome to this repo where we'll continue adding hands-on examples demonstrating the use of the Java SDK for Amazon Bedrock.☆15Jul 12, 2024Updated last year
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆16Updated this week
- Python wrapper for lsm1 extension for sqlite4☆15Feb 27, 2025Updated last year
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆11Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆123Updated this week
- Create CDK Apps from Templates☆20Jun 22, 2022Updated 3 years ago
- This repository contains code written in the AWS Cloud Development Kit (CDK) which launches infrastructure across two different regions t…☆12Mar 10, 2022Updated 4 years ago
- The public facing site of the OCF☆18Feb 15, 2024Updated 2 years ago
- This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.☆10May 7, 2025Updated 11 months ago
- Projen project base types used at ClickUp☆13Updated this week
- Geospatial python toolkit: common functions, easy CLI creation, dataframes streams☆18May 16, 2024Updated last year
- The open source version of the AWS Data Pipeline documentation. To provide feedback & requests for changes, submit issues in this reposit…☆15Jun 15, 2023Updated 2 years ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Nov 25, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Sample solution to build a deployment pipeline for Amazon SageMaker.☆13Jul 18, 2022Updated 3 years ago
- This solution provides a way to deploy SageMaker Studio in a private and secure environment. The solution integrates with a Custom SAML 2…☆14Apr 11, 2023Updated 3 years ago
- Offical Flyway Community Supported Database Plugins☆16Aug 18, 2025Updated 7 months ago
- JSConf forever.☆51Jun 12, 2018Updated 7 years ago
- MLOps Pipeline Using SageMaker & CDK, where models are from SageMaker built-in algorithms.☆27Apr 1, 2025Updated last year
- ☆14Dec 16, 2019Updated 6 years ago
- EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Goo…☆44Aug 26, 2024Updated last year
- This package contains the grammar in ANTLR g4 format and Java parser for the Data Quality Definition Language (DQDL), used by AWS Glue Da…☆22Mar 26, 2026Updated 2 weeks ago
- ☆13Feb 26, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆15Feb 12, 2026Updated last month
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Fine grain access in Amazon Managed Workflows For Apache Airflow☆11Jul 30, 2021Updated 4 years ago
- ☆32Feb 29, 2024Updated 2 years ago
- ☆29Jan 18, 2023Updated 3 years ago
- This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images from…☆10Jul 12, 2022Updated 3 years ago
- An implementation of Dijkstra in Clojure☆19Aug 7, 2012Updated 13 years ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 9 months ago
- Airflow support for Marquez☆30Dec 11, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- library for processing s3select queries and execute them on CSV files (current phase)☆18Jan 5, 2026Updated 3 months ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆76Feb 14, 2026Updated last month
- Husky for python☆13Jun 5, 2017Updated 8 years ago
- A repository to provide an example of deploying jaeger into an AWS ECS cluster☆13May 9, 2019Updated 6 years ago
- This Guidance helps customers design a resilient batch process application using AWS services☆19Mar 1, 2026Updated last month
- Pachyderm/MLeap team up to provide versioned datasets + models☆10Jun 7, 2017Updated 8 years ago
- React web app for Construct Hub☆21Mar 30, 2026Updated last week