Primary repository for NYC DCP's Data Engineering team
☆38Apr 4, 2026Updated last week
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 11, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 10 months ago
- Data pipelines for datasets that are part of the Recovery Data Partnership project☆12Oct 26, 2022Updated 3 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- This is a GAS application for rearranging Google Apps Scripts (GAS) in a project which can be seen at the script editor.☆16Apr 14, 2018Updated 7 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Presentations from the 2024 Fellowship.☆15Sep 17, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 10 months ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Step by step instructions to create a production-ready data pipeline☆58Dec 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A full text and metadata extractor for CKAN☆19Nov 21, 2018Updated 7 years ago
- ao3 beautiful txt getter archieveofourown download☆14Oct 8, 2023Updated 2 years ago
- CalData's MDSA project with Caltrans on Performance Measurement System (PeMS) data☆12Aug 19, 2025Updated 7 months ago
- Place for sharing quick reports, and works in progress☆33Updated this week
- Tools for working with types where a subset of values has a total order, like e.g. floats without NaN☆13Nov 7, 2025Updated 5 months ago
- A machine-readable format for storing and sharing water rate structures.☆24Oct 22, 2020Updated 5 years ago
- Common GitHub actions and workflows for maintaining dbt☆15Updated this week
- A tool for creating pivot tables from the command line.☆14Mar 16, 2023Updated 3 years ago
- Rust tools for working with CSV files: scrubcsv, catcsv, fixed2csv, geochunk, hashcsv.☆19Jan 17, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Manage multiple dat instances on a single machine☆14Mar 29, 2016Updated 10 years ago
- A Python API client used to pull and retrieve data from the US Bureau of Economic Analysis☆23Dec 23, 2021Updated 4 years ago
- Source code and logic to build Luau for Rust☆11Mar 28, 2026Updated 2 weeks ago
- Source code and logic to build LuaJIT 2.1 for Rust☆19Mar 28, 2026Updated 2 weeks ago
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated 11 months ago
- Instructions for using and issue tracking for the hosted GitHub Actions runner for IBM Power and IBM Z and LinuxONE☆24Feb 23, 2026Updated last month
- Prints numbers with separators☆11Apr 29, 2021Updated 4 years ago
- Figures out the local timezone as IANA / Olson identifier☆15Apr 8, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CKAN Issues Extension☆14Sep 16, 2021Updated 4 years ago
- Visualizing data to better monitor issues around food security☆14Nov 28, 2024Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆103Jun 7, 2024Updated last year
- Lib flatterer: A lib to make JSON flatterer☆17May 16, 2025Updated 10 months ago
- Data on 268 New York City traffic deaths in 2014.☆10Feb 19, 2015Updated 11 years ago
- ☆25Mar 28, 2026Updated 2 weeks ago
- Robust Rust library for converting JSON objects into CSV rows☆11Sep 18, 2023Updated 2 years ago