linkedin/data-integration-library

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linkedin/data-integration-library)

linkedin / data-integration-library

The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.

☆33

Alternatives and similar repositories for data-integration-library

Users that are interested in data-integration-library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

opendatamesh-initiative / odm-platform
View on GitHub
A platform to manage the data product life cycle
☆22Mar 25, 2026Updated 4 months ago
BenFradet / spark-kaggle
View on GitHub
Different entries to kaggle contests using Apache Spark
☆13Jun 5, 2017Updated 9 years ago
mosquito / python-lsm
View on GitHub
Python wrapper for lsm1 extension for sqlite4
☆15Feb 27, 2025Updated last year
smart-data-lake / smart-data-lake
View on GitHub
Smart Automation Tool for building modern Data Lakes and Data Pipelines
☆129Updated this week
cdk-dev / create-cdk-app
View on GitHub
Create CDK Apps from Templates
☆20Jun 22, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aws-samples / aws-appsync-multi-region-deployment
View on GitHub
This repository contains code written in the AWS Cloud Development Kit (CDK) which launches infrastructure across two different regions t…
☆12Mar 10, 2022Updated 4 years ago
projen / awesome
View on GitHub
Curated list of awesome projen projects
☆13Jan 1, 2025Updated last year
aws-mwaa / upstream-to-airflow
View on GitHub
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆11Updated this week
aws-samples / data-mesh-datazone-cdk-cloudformation
View on GitHub
This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.
☆10Updated this week
culebron / erde
View on GitHub
Geospatial python toolkit: common functions, easy CLI creation, dataframes streams
☆19May 16, 2024Updated 2 years ago
aolney / fable-jupyterlab-blockly-extension
View on GitHub
A JupyterLab extension implementing a Blockly palette with Fable tooling.
☆12Mar 4, 2023Updated 3 years ago
potassco / tree-sitter-clingo
View on GitHub
🌳 Clingo grammar for tree-sitter
☆15Jul 1, 2026Updated 3 weeks ago
tharwaninitin / etlflow
View on GitHub
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Goo…
☆45Aug 26, 2024Updated last year
alviano / asp-chef
View on GitHub
☆13Jun 25, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aws-samples / amazon-sagemaker-mlops-byoc-using-codepipeline-aws-cdk
View on GitHub
Sample solution to build a deployment pipeline for Amazon SageMaker.
☆14Jul 18, 2022Updated 4 years ago
leegilmorecode / Serverless-AWS-CDK-Best-Practices-Patterns-Part2
View on GitHub
An opinionated discussion around how to set up, structure, and deploy your AWS CDK Serverless apps using CDK Pipelines in line with AWS b…
☆13Mar 4, 2023Updated 3 years ago
aws-samples / aws-autonomous-driving-data-lake-image-extraction-pipeline-from-ros-bagfiles
View on GitHub
This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images from…
☆11Jul 12, 2022Updated 4 years ago
aws-samples / amazon-sagemaker-studio-secure-sso
View on GitHub
This solution provides a way to deploy SageMaker Studio in a private and secure environment. The solution integrates with a Custom SAML 2…
☆14Apr 11, 2023Updated 3 years ago
SSripilaipong / lyrid
View on GitHub
☆29Jan 18, 2023Updated 3 years ago
SemyonSinchenko / flake8-pyspark-with-column
View on GitHub
A flake8 plugin that detects of usage withColumn in a loop or inside reduce
☆28Jun 20, 2025Updated last year
aws-samples / amazon-aurora-database-migration-workshop-reinvent2019
View on GitHub
☆14Dec 16, 2019Updated 6 years ago
aws-samples / aws-lakeformation-datasharing-workflow
View on GitHub
☆15Feb 12, 2026Updated 5 months ago
aws-samples / mwaa-rbac-task
View on GitHub
Fine grain access in Amazon Managed Workflows For Apache Airflow
☆11Jul 30, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
airlift / command
View on GitHub
Convenience library for executing external processes
☆21Sep 27, 2017Updated 8 years ago
HL7 / fhir-omop-ig
View on GitHub
A FHIR implementation guide that supports conversion of data from FHIR to OMOP and OMOP to FHIR
☆16Updated this week
marchinho11 / hnhm
View on GitHub
Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model
☆23Dec 24, 2024Updated last year
cmudig / solas
View on GitHub
Visualization Recommendation Based on Analysis History
☆15Aug 31, 2023Updated 2 years ago
build-on-aws / amazon-bedrock-java-examples
View on GitHub
Welcome to this repo where we'll continue adding hands-on examples demonstrating the use of the Java SDK for Amazon Bedrock.
☆16Jul 12, 2024Updated 2 years ago
jenkinsci / octopusdeploy-plugin
View on GitHub
Jenkins plugin which integrates with Octopus Deploy
☆10Dec 12, 2021Updated 4 years ago
davis68 / letterhead
View on GitHub
University of Illinois letterhead template
☆14Jun 4, 2019Updated 7 years ago
bramucas / xclingo2
View on GitHub
A tool for explainability and debugging in Answer Set Programming.
☆15May 15, 2026Updated 2 months ago
signal-ai / jaeger-aws
View on GitHub
A repository to provide an example of deploying jaeger into an AWS ECS cluster
☆13May 9, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
awslabs / dqdl
View on GitHub
This package contains the grammar in ANTLR g4 format and Java parser for the Data Quality Definition Language (DQDL), used by AWS Glue Da…
☆23Updated this week
ceph / s3select
View on GitHub
library for processing s3select queries and execute them on CSV files (current phase)
☆18Jan 5, 2026Updated 6 months ago
SemyonSinchenko / graphframes-rs
View on GitHub
GraphFrames but in DataFusion
☆27Updated this week
google-coral / demo-manufacturing
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
VIDA-NYU / pycalibrate
View on GitHub
pycalibrate is a Python library to visually analyze model calibration in Jupyter Notebooks
☆17Jul 2, 2022Updated 4 years ago
the-ocf / public-site
View on GitHub
The public facing site of the OCF
☆18Feb 15, 2024Updated 2 years ago
minkebox / minkenet
View on GitHub
Unified Software Networking for cheap hardware
☆13Jun 20, 2021Updated 5 years ago