Xadra-T / End2End-Data-PipelineView external linksLinks
An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to learn some common data engineering practices.
☆56Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for End2End-Data-Pipeline
Users that are interested in End2End-Data-Pipeline are comparing it to the libraries listed below
Sorting:
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆55Dec 8, 2024Updated last year
- A Python CLI application that demonstrates how you can access AWS services, such as Amazon S3 and Amazon Athena, using trusted identity p…☆12Mar 11, 2025Updated 11 months ago
- ☆11Nov 26, 2024Updated last year
- Real-time OLTP system for credit card fraud detection using AWS API Gateway, Kinesis, and RDS PostgreSQL. Features a scalable, serverless…☆22Dec 16, 2024Updated last year
- Terragrunt friendly module to create AWS API Gateway (V1) w\Optional WAF, many stages/api keys/usage plans using the OpenAPI 3.x spec. 🇺…☆11Feb 6, 2026Updated last week
- Serverless Multi-Tenant Application on AWS Amplify☆17Jan 11, 2024Updated 2 years ago
- Spark application to consume kafka events generated by a python producer.☆12Aug 7, 2021Updated 4 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆16Feb 19, 2023Updated 2 years ago
- Data Engineering Hours With Experts Coding Challenge☆12Sep 14, 2023Updated 2 years ago
- ☆13Nov 22, 2024Updated last year
- It shows a simple chatbot based on bedrock LLM where Question/Answering and Summerization of a document are provided based on LangChain.☆11Nov 27, 2023Updated 2 years ago
- ☆16Oct 18, 2023Updated 2 years ago
- Useful notes and tips about Python Programming Language☆13Dec 1, 2024Updated last year
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- Build your portfolio in minutes☆12Jan 28, 2021Updated 5 years ago
- Sample application showcasing the use of Dapr to build microservices based apps☆15Feb 4, 2026Updated last week
- ☆14Nov 26, 2020Updated 5 years ago
- Fully configurable terraform module to access AWS APIs from Github Actions through OpenID Connect.☆19Jun 11, 2025Updated 8 months ago
- ☆18Nov 18, 2025Updated 2 months ago
- Examples for consuming Private API Gateway across AWS Accounts☆22Updated this week
- ☆20Oct 18, 2023Updated 2 years ago
- Facemask detection using MobileNet☆15Jun 18, 2022Updated 3 years ago
- A curated list of awesome resources related to Commercetools☆21May 9, 2024Updated last year
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆26Jul 2, 2025Updated 7 months ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- ☆19Jun 28, 2024Updated last year
- ☆15Nov 16, 2024Updated last year
- Backend API developed using FastAPI for a simplified version of a digital book collection.☆17Sep 17, 2024Updated last year
- Spot roadmap☆25Mar 26, 2025Updated 10 months ago
- Execute DBT core on cloud run☆21May 6, 2024Updated last year
- Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes s…☆33Nov 4, 2025Updated 3 months ago
- Terraform code to deploy a SageMaker domain in VPC-only mode that supports multiple Studio and Canvas features☆20Aug 16, 2023Updated 2 years ago
- ☆18Jun 16, 2024Updated last year
- Template to spin up delta lake locally using docker☆23Oct 2, 2023Updated 2 years ago
- ☆26May 25, 2022Updated 3 years ago
- ☆21Nov 4, 2023Updated 2 years ago
- ☆40Updated this week
- Databricks. Incremental data processing, task orchestration, and production job monitoring.☆37Feb 27, 2024Updated last year
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago