A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Architecture)
☆189Mar 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for big-data-pipeline-lambda-arch
Users that are interested in big-data-pipeline-lambda-arch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 3, 2022Updated 3 years ago
- Simple demo implementation of Lambda and Kappa architectures using Python, Docker, Kafka, Spark and Cassandra☆40Mar 15, 2018Updated 8 years ago
- Kafka in a Container☆14Dec 30, 2021Updated 4 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python☆22Mar 8, 2020Updated 6 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- InfluxDB 2 Connector for Kafka☆13Mar 6, 2020Updated 6 years ago
- ☆12Sep 6, 2018Updated 7 years ago
- An open-source backtesting and live trading platform for using to foreign exchange☆76Jan 3, 2025Updated last year
- This application receives messages from a mqtt broker and sends the messages to a kafka cluster. Topic mapping is configurable.☆29Dec 25, 2025Updated 3 months ago
- A Google Chrome extension to download Udacity.com videos for offline watching☆40Dec 4, 2013Updated 12 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆15Apr 4, 2026Updated last week
- ☆75Jun 27, 2020Updated 5 years ago
- Supervisor trees for Go☆11Nov 4, 2017Updated 8 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A personal finance app.☆33Jun 3, 2016Updated 9 years ago
- ☆10Feb 10, 2017Updated 9 years ago
- Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary…☆31Jul 6, 2021Updated 4 years ago
- A full microservice architecture with Java, Spring Cloud, Log management with ELK, Server load balancing with Nginx, Infrastructure manag…☆455Apr 19, 2024Updated last year
- GXC-CMS Application Template☆30Jun 21, 2012Updated 13 years ago
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆14Mar 1, 2026Updated last month
- 🌟 An end-to-end full-stack data science project, including modelling, MLOps, and data storytelling. ✨☆16Aug 30, 2025Updated 7 months ago
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Sep 13, 2016Updated 9 years ago
- Use Confluent KSQL with node.js and socket.io to PUSH data to chartjs☆28Aug 18, 2020Updated 5 years ago
- Simple Doodle Classifier written in python☆13May 19, 2018Updated 7 years ago
- Linux, Container, Infrastructure related resources and blogs☆14Jul 14, 2024Updated last year
- Sample code for working with HBase Thrift.☆15Jul 25, 2013Updated 12 years ago
- Batch-clip multiple web pages from a text file.☆21Jul 22, 2019Updated 6 years ago
- ☆10Jun 13, 2018Updated 7 years ago
- A tiny library to allow your RESTful resources to be expanded and/or filtered.☆29Oct 12, 2014Updated 11 years ago
- StarCraft 2 Data Pipeline with Airflow, DuckDB and Streamlit☆16Mar 14, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- scikit-learn: machine learning in Python☆13Mar 14, 2025Updated last year
- wasmCloud project templates - use with 'wash new'☆13Dec 27, 2023Updated 2 years ago
- Annotation for structural pattern matching using VAVR☆16Mar 9, 2026Updated last month
- Flask app to calculate compensation of a data scientist☆12Dec 27, 2022Updated 3 years ago
- Go & React Template☆12Oct 13, 2023Updated 2 years ago
- Event listener provider for Keycloak that writes to Kafka☆11Jun 11, 2019Updated 6 years ago
- AWS Solutions Architect Associate (SAA-C02) Exam Prep Course - 2020 UPDATED!, published by Packt☆15Sep 3, 2020Updated 5 years ago