feluelle / finance-data-builderLinks
Finance π¦ Data Builder π οΈ @ postgres π
β21Updated 4 years ago
Alternatives and similar repositories for finance-data-builder
Users that are interested in finance-data-builder are comparing it to the libraries listed below
Sorting:
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β55Updated 2 years ago
- Full stack data engineering tools and infrastructure set-upβ53Updated 4 years ago
- Data lake, data warehouse on GCPβ56Updated 3 years ago
- Code for dbt tutorialβ157Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,β¦β90Updated 3 years ago
- Snowflake Cookbook, published by Packtβ79Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflowβ47Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startupsβ16Updated 6 years ago
- data engineering 100 days π€ π§² π¦Ύ | #DEβ39Updated last year
- Execution of DBT models using Apache Airflow through Docker Composeβ117Updated 2 years ago
- Code test for data engineering candidatesβ47Updated last year
- Course Material Data Engineering on AWS Courseβ29Updated 8 months ago
- Code for my "Efficient Data Processing in SQL" book.β56Updated 10 months ago
- β18Updated 9 months ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as stagingβ¦β88Updated 5 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projectsβ83Updated last year
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.β29Updated last year
- Example repo to create end to end tests for data pipeline.β24Updated 11 months ago
- Design/Implement stream/batch architecture on NYC taxi data | #DEβ25Updated 4 years ago
- Cloned by the `dbt init` taskβ61Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setupβ88Updated 4 years ago
- A repo to track data engineering projectsβ13Updated 2 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for β¦β136Updated 5 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3β30Updated 4 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.β16Updated 4 years ago
- β40Updated 11 months ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflowβ146Updated 4 years ago
- β17Updated 10 months ago
- Dockerizing an Apache Spark Standalone Clusterβ43Updated 2 years ago
- β31Updated 6 years ago