An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)
☆16Sep 20, 2023Updated 2 years ago
Alternatives and similar repositories for aws_end_to_end_streaming_pipeline
Users that are interested in aws_end_to_end_streaming_pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Sep 26, 2023Updated 2 years ago
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- Get Crypto data from API, stream it to Kafka with Airflow. Write data to MySQL and visualize with Metabase☆17Oct 2, 2023Updated 2 years ago
- Provision Ubuntu VMs with Vagrant and VMware on macOS ARM64☆11Oct 26, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Feb 6, 2023Updated 3 years ago
- In this project, we will be deploying a Kubernetes cluster using a Jenkins CI/CD pipeline. We will be utilizing various DevOps tools such…☆14Jun 6, 2023Updated 3 years ago
- ☆22Oct 21, 2024Updated last year
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆22Aug 2, 2022Updated 3 years ago
- Data Engineering Youtube Project☆12Jun 29, 2023Updated 2 years ago
- ☆11Jul 30, 2017Updated 8 years ago
- Learn how multimodal AI merges text, image, and audio for smarter models☆30Jan 21, 2025Updated last year
- This repository contains examples for my article published on Medium☆11Oct 29, 2017Updated 8 years ago
- Docker Images with Databricks Connect Ready to go☆24Dec 26, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A finance dashboard built with Next.js + Chakra UI ⚡. Powered by the Plaid Open Banking API 🧩☆13Sep 16, 2021Updated 4 years ago
- code snippet for analytics sessions☆34May 17, 2022Updated 4 years ago
- A Deep Reinforcement Learning Library for Automated Trading in Quantitative Finance. NeurIPS 2020. Please star. 🔥☆12Mar 13, 2023Updated 3 years ago
- This repository provides a Linux kernel driver for AXI UART Lite accessed via PCIe XDMA. It enables efficient DMA-based UART communicatio…☆16May 2, 2025Updated last year
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆19Jul 19, 2025Updated 11 months ago
- ☆12Sep 22, 2021Updated 4 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Flink 中文社区文章整理☆13Jun 3, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A distributed Kafka Consumer in Python using Ray☆24Feb 7, 2023Updated 3 years ago
- ☆15Jul 1, 2023Updated 2 years ago
- Importing AdventureWorks (SQL Server Sample Database) to Neo4j☆15Jun 17, 2025Updated last year
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆17Mar 1, 2021Updated 5 years ago
- ☆40Jul 16, 2024Updated last year
- ☆13May 21, 2021Updated 5 years ago
- Plant disease detection app with over 4000 downloads that utilizes ResNet-50 CNN architecture for image classification.☆15Dec 26, 2021Updated 4 years ago
- An ecommerce site for 3D-printed models, built with Python (Django)☆12Nov 17, 2023Updated 2 years ago
- the benchmark for finance☆11Jul 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- Unsupervised ML: Finding Customer Segments in General Population☆16Oct 25, 2019Updated 6 years ago
- Generic Database Access Layer implementation in Golang.☆12Feb 8, 2026Updated 4 months ago
- This IoT-based smart irrigation system, utilizing ESP32 and Blynk, automates irrigation based on soil conditions. Remote monitoring enhan…☆24Jun 13, 2025Updated last year
- Design data models, build data warehouses, data lakes & lakehouse, automate data pipelines - SQL | NoSQL | AWS | Spark | Airflow☆16Aug 19, 2023Updated 2 years ago
- 计算广告召回&模型&创意算法(A collection of research and application papers about Match, Ranking, Targeting and Creatives in Internet advertising.)☆18Jan 26, 2024Updated 2 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆47Sep 26, 2024Updated last year