Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
☆245Sep 12, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn how to create reliable ML systems by testing code, data and models.☆93Sep 12, 2022Updated 3 years ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆61Sep 12, 2022Updated 3 years ago
- Learn how to monitor ML systems to identify and mitigate sources of drift before model performance decay.☆104Sep 12, 2022Updated 3 years ago
- Learn how to design, develop, deploy and iterate on production-grade ML applications.☆3,379Aug 16, 2024Updated last year
- Data Engineer Roadmaps as Projects Funnel☆12Aug 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a repository of scripts developed as part of the 2020 ENCMP100 Section B3 lecture taught at University of Alberta.☆10Apr 2, 2020Updated 6 years ago
- learning-by-doing data model built with dbt-core☆17Apr 10, 2026Updated last month
- Data Engineering Bootcamp 2021☆13Aug 8, 2023Updated 2 years ago
- An MLflow Provider Package for Apache Airflow☆26Oct 22, 2025Updated 7 months ago
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated 2 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆801Mar 10, 2026Updated 2 months ago
- ☆31Jul 29, 2023Updated 2 years ago
- Easily import a module and mock its dependencies in an isolated way.☆13May 19, 2022Updated 4 years ago
- Intelligent Document Processing with AWS AI/ML, published by Packt☆12Apr 22, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year
- ☆24May 11, 2025Updated last year
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 5 years ago
- ☆14Mar 9, 2023Updated 3 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆20Mar 25, 2026Updated 2 months ago
- Learn how to develop, deploy and iterate on production-grade ML applications.☆47,763Mar 4, 2026Updated 2 months ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Apr 1, 2024Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- Optimizing Hyperparameters with Conformal Quantile Regression☆11May 22, 2023Updated 3 years ago
- An open courseware project in Deep Learning using Keras + Tensorflow. // Projeto aberto, em português, dedicado ao ensino em Deep Learnin…☆13Apr 18, 2019Updated 7 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 5 years ago
- Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when yo…☆12Nov 11, 2022Updated 3 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,189Oct 4, 2024Updated last year
- Collection of Python utility scripts & OOP basic demo | #SE☆14Jan 8, 2025Updated last year
- ☆25Apr 23, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes s…☆43Apr 18, 2026Updated last month
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆40,979May 3, 2026Updated 3 weeks ago
- Visual Inspection AI Edge solution infrastructure provisioning scripts☆17Nov 12, 2024Updated last year
- Duke MIDS: Data Engineering and DataOps Course☆70Jan 10, 2025Updated last year
- Pretty notification box☆16May 31, 2022Updated 3 years ago
- The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย☆115Mar 1, 2026Updated 2 months ago
- Snippets of the basic course from Batch Scripting tutorial☆13Aug 15, 2021Updated 4 years ago