This is project documentation templates derived from CRISP-DM to be used for Data Engineering projects.
☆61Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for data-engineering-project-doc-templates
Users that are interested in data-engineering-project-doc-templates are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Knowledge sharing - Cheat sheets☆21Updated this week
- This repo is about how-to-use Indonesian NER with spaCy☆17Mar 27, 2022Updated 4 years ago
- Extract, transform, and load data for analytic processing using AWS Glue☆17May 2, 2021Updated 4 years ago
- Python version of dbtools☆12Jul 30, 2025Updated 7 months ago
- A Python wrapper for the Iterable API☆12Jan 7, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ms-dataverse is a Python module for Microsoft Dataverse, offering a lightweight ORM to query, create, update, and delete entities. Utiliz…☆13Apr 10, 2023Updated 2 years ago
- Listen to a Redis PubSub channel and then rebroadcast it over Server-Sent Events (SSE).☆12Mar 19, 2026Updated last week
- ☆12Aug 8, 2023Updated 2 years ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆18Mar 12, 2026Updated 2 weeks ago
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆49Mar 13, 2026Updated 2 weeks ago
- ☆11Jan 9, 2022Updated 4 years ago
- An introduction to importing, visualizing, and analyzing climate data in MATLAB.☆21Oct 10, 2025Updated 5 months ago
- A Matlab project uses Signal Processing Technique — Digital Watermark to hide data in an audio track☆19Jun 16, 2019Updated 6 years ago
- ☆11Apr 13, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A parser combinator library in Go☆14Feb 17, 2020Updated 6 years ago
- Codes, datasets, and explanations for some basic natural language tasks and models.☆11Dec 9, 2020Updated 5 years ago
- Scrapper and analyzer of shared scooter data☆11Jul 30, 2024Updated last year
- A map transformer which implements the `Stream Maps` capability from Meltano's tap and target SDK: https://sdk.meltano.com/☆19Updated this week
- Example FastAPI app deployed to AWS with CDK.☆16Feb 23, 2023Updated 3 years ago
- Testing Boring SL with DuckDB☆32Aug 18, 2025Updated 7 months ago
- SQL static analyzer for performance, security, compliance and cost. 272 rules. Completely offline. Works in CI pipelines.☆94Updated this week
- chDB AWS Lambda container☆18Aug 31, 2023Updated 2 years ago
- Repository for Data Engineering Zoomcamp 2024☆14Mar 25, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Descarga películas y series gratis, fácil y rápido.☆11Mar 17, 2021Updated 5 years ago
- Process manager and website for hosting multiple Streamlit apps☆14Jun 28, 2023Updated 2 years ago
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆16Jul 12, 2025Updated 8 months ago
- Curated list of yield farms and tools 🤑☆12Jul 15, 2021Updated 4 years ago
- Data encoding library for Haskell.☆12Aug 4, 2023Updated 2 years ago
- Sample projects and examples on using NetApp technologies with Kubernetes☆13Mar 21, 2018Updated 8 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Sep 29, 2020Updated 5 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆24Aug 17, 2023Updated 2 years ago
- A complete No-Code Machine Learning platform built with Streamlit. Upload datasets, visualize data, and train models (Regression, SVM, K-…☆16Dec 3, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Image building contents for running Spark standalone on Kubernetes☆16Apr 10, 2020Updated 5 years ago
- Wrapper for Spotify API that generates user-specific playlists☆14Feb 15, 2023Updated 3 years ago
- 🦆 Small docker image with DuckDB☆21May 23, 2024Updated last year
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 2 months ago
- Swap between Yearn Vaults V2☆10Feb 12, 2021Updated 5 years ago
- ☆11May 13, 2021Updated 4 years ago
- ☆14Nov 26, 2020Updated 5 years ago