A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Architecture)
☆190Mar 17, 2026Updated 2 months ago
Alternatives and similar repositories for big-data-pipeline-lambda-arch
Users that are interested in big-data-pipeline-lambda-arch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 3, 2022Updated 4 years ago
- ☆10May 1, 2023Updated 3 years ago
- Simple demo implementation of Lambda and Kappa architectures using Python, Docker, Kafka, Spark and Cassandra☆40Mar 15, 2018Updated 8 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python☆22Mar 8, 2020Updated 6 years ago
- Goal: create a Spring Boot application that handles users using Event Sourcing. So, whenever a user is created, updated, or deleted, an e…☆28Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- An open-source backtesting and live trading platform for using to foreign exchange☆77Jan 3, 2025Updated last year
- ☆13Jul 16, 2018Updated 7 years ago
- ☆11Mar 30, 2025Updated last year
- A Google Chrome extension to download Udacity.com videos for offline watching☆39Dec 4, 2013Updated 12 years ago
- ☆14May 21, 2026Updated 3 weeks ago
- ☆44Jul 24, 2025Updated 10 months ago
- A benchmarking for python client libraries of Apache Kafka (pycontw 2017)☆17Jun 27, 2017Updated 8 years ago
- Setting up macOS the fast-way. Docs available at https://i3p9.github.io/mac-setup/☆12Oct 15, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- Supervisor trees for Go☆10Nov 4, 2017Updated 8 years ago
- Kubernetes Worksphop Material☆16May 10, 2019Updated 7 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Quickly set up a POC environment for Kafka+Spark☆14Oct 10, 2017Updated 8 years ago
- A Realtime Analytics Engine using Kafka, Spark & MongoDB☆15Feb 28, 2017Updated 9 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆51Dec 4, 2023Updated 2 years ago
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆31Jul 6, 2021Updated 4 years ago
- domain driven design in Go☆14Aug 18, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Feb 10, 2017Updated 9 years ago
- B19415 - The Definitive Guide to Data Integration☆11Apr 15, 2024Updated 2 years ago
- A full microservice architecture with Java, Spring Cloud, Log management with ELK, Server load balancing with Nginx, Infrastructure manag…☆459Apr 19, 2024Updated 2 years ago
- ☆15Jan 16, 2018Updated 8 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- Source code of my Eleventy-powered website☆43Jun 3, 2026Updated last week
- Scrapping and analyzing TikTok videos☆11Feb 19, 2023Updated 3 years ago
- A semantic web crawler☆20Sep 20, 2010Updated 15 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆45Jan 4, 2024Updated 2 years ago
- ☆16Sep 13, 2016Updated 9 years ago
- SKOS Support for Apache Lucene and Solr☆55May 12, 2021Updated 5 years ago
- Distributed Programming Assignments in Golang☆12Sep 13, 2016Updated 9 years ago
- Streamdata.io Stock Market Data Streaming To AWS S3 Data Lake Using Lambda Serverless☆11Jul 14, 2018Updated 7 years ago
- Use Confluent KSQL with node.js and socket.io to PUSH data to chartjs☆28Aug 18, 2020Updated 5 years ago
- Activator showing integration of AspectJ to monitor the ActorSystem☆14May 31, 2017Updated 9 years ago