feluelle / finance-data-builder
Finance π¦ Data Builder π οΈ @ postgres π
β20Updated 4 years ago
Alternatives and similar repositories for finance-data-builder:
Users that are interested in finance-data-builder are comparing it to the libraries listed below
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β53Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,β¦β90Updated 3 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.β29Updated last year
- Data lake, data warehouse on GCPβ55Updated 3 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.β29Updated last year
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as β¦β16Updated 5 years ago
- Amazon Redshift Cookbook, Published by Packtβ15Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize aβ¦β19Updated 9 months ago
- Full stack data engineering tools and infrastructure set-upβ48Updated 4 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMakerβ31Updated 3 years ago
- Snowflake Cookbook, published by Packtβ76Updated 2 years ago
- Cloned by the `dbt init` taskβ60Updated 9 months ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startupsβ16Updated 6 years ago
- Explore tips and tricks to deploy machine learning models with Docker.β13Updated last year
- Serverless ETL and Analytics with AWS Glue, published by Packtβ46Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online courseβ18Updated 4 years ago
- dbt / Amazon Redshift Demonstration Projectβ33Updated 2 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract dβ¦β24Updated 3 years ago
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' andΒ MLFlow'β119Updated last year
- Udacity Data Pipeline Exercisesβ15Updated 4 years ago
- β18Updated 3 years ago
- β17Updated 6 months ago
- β84Updated last year
- Airflow training for the crunch confβ105Updated 6 years ago
- β87Updated 2 years ago
- A repo to track data engineering projectsβ13Updated 2 years ago
- End-to-end data platform leveraging the Modern data stackβ46Updated 10 months ago
- Code for my "Efficient Data Processing in SQL" book.β55Updated 6 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in β¦β21Updated 2 years ago