An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information
☆30Apr 22, 2022Updated 4 years ago
Alternatives and similar repositories for etl-pipeline-example
Users that are interested in etl-pipeline-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repository for the LinkedIn Learning course Advanced AWS CloudFormation for Enterprise☆11Nov 5, 2023Updated 2 years ago
- ☆13Jun 15, 2023Updated 2 years ago
- ☆95Sep 14, 2022Updated 3 years ago
- Simple, performant data pipelines.☆10Jan 6, 2022Updated 4 years ago
- ☆19Nov 27, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SnowShu is a sampling engine designed to support testing in data development.☆12Aug 26, 2025Updated 8 months ago
- This is a demo repository for parallel multi-index question answering using streamlit and llama index☆24Aug 31, 2023Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Sep 1, 2018Updated 7 years ago
- 🎲 Repositório para armazenar todos os componentes referentes a Data Science / Data Engineering do projeto☆17Oct 27, 2023Updated 2 years ago
- ☆13Apr 8, 2020Updated 6 years ago
- how to be an AI Engineer☆15Jul 27, 2023Updated 2 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated last year
- Datawaves data models for Ethereum built using dbt☆21Sep 3, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python package for Plotly/Dash apps with support for multi-page, modules, and new charts such as Pareto with an Object Orient Approach☆20Aug 5, 2022Updated 3 years ago
- Relational Database Import to Big Query with Dataflow and DLP API☆18Dec 16, 2019Updated 6 years ago
- LSPosed module to send Monsieur Cuisine data to MQTT for Homeassistant.☆25Jul 10, 2025Updated 9 months ago
- A SparkSQL formatter based on https://github.com/zeroturnaround/sql-formatter, with customizations and extra features.☆14Nov 7, 2024Updated last year
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆29Feb 5, 2024Updated 2 years ago
- Run Azure Data Factory self-hosted integration runtime on Azure App Service☆11Apr 25, 2023Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- EDB Reference Architectures☆16Jan 18, 2023Updated 3 years ago
- Web tool for viewing real-time map data from MyGeotab☆16Mar 4, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Jupyter notebooks for cloud-based usage☆10Aug 26, 2023Updated 2 years ago
- A digit autoencoder for humans 🧬☆33Jun 4, 2021Updated 4 years ago
- ☆21Mar 31, 2024Updated 2 years ago
- Super simple KeyValue store for python, backed by sqlite.☆13Apr 18, 2024Updated 2 years ago
- A repository dedicated to storing guided projects completed while learning data science concepts with Dataquest.☆13Jan 11, 2025Updated last year
- Youtube Too Long Didn't Watch☆13Sep 2, 2024Updated last year
- A small tool for embedding files in a Go source file.☆11Nov 3, 2020Updated 5 years ago
- AWS Certified Solutions Architect Professional SAP-C01 New Feb 2019 Version Exam Notes☆17Apr 6, 2019Updated 7 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Jul 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14May 22, 2023Updated 2 years ago
- Tool to help migrate application code from Oracle to PostgreSQL☆24Jan 2, 2023Updated 3 years ago
- A simple IP lookup tool written in Python with concurrency support.☆16Jul 12, 2025Updated 9 months ago
- Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"☆17Jul 27, 2024Updated last year
- The Regularization Cookbook, published by Packt☆16Mar 2, 2026Updated last month
- Add shortcuts for Logseq task management☆13Jul 4, 2023Updated 2 years ago
- ☆64Jan 9, 2024Updated 2 years ago