dataquestio / analytics_pipeline
Code to build a simple analytics data pipeline with Python
☆102Updated 8 years ago
Alternatives and similar repositories for analytics_pipeline:
Users that are interested in analytics_pipeline are comparing it to the libraries listed below
- Course materials for my data pipeline video course with O'Reilly☆194Updated 7 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆102Updated 7 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- ☆199Updated 3 years ago
- ☆16Updated 7 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆122Updated 2 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated 2 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Repository used for Spark Trainings☆53Updated last year
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆46Updated 3 years ago
- Code, slides, and documentation for the talks I have given.☆113Updated last year
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- ☆201Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆54Updated 2 years ago
- Airflow ETL for Meetup API☆46Updated 6 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- Updated repository☆157Updated 3 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Data Analytics, Statistics, Visualization (R / Python)☆92Updated 7 years ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 5 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆27Updated 7 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago