An example repository showing how to leverage Kafka to stream your data
☆21May 11, 2024Updated last year
Alternatives and similar repositories for real_time_streaming_pipeline
Users that are interested in real_time_streaming_pipeline are comparing it to the libraries listed below
Sorting:
- learning-by-doing data model built with dbt-core☆15Dec 13, 2025Updated 2 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- prebuilt configurations for docker-rpm-builder☆11Feb 5, 2021Updated 5 years ago
- Github action for running python unit tests☆10Jun 16, 2025Updated 8 months ago
- Playground site for creating/validating data contracts☆11Aug 9, 2025Updated 7 months ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated last year
- Architecture principles☆13May 23, 2025Updated 9 months ago
- Extremely low-level wrapper to the MediaWiki API☆27Mar 15, 2017Updated 8 years ago
- ETL Pipeline using Luigi☆10Nov 15, 2017Updated 8 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- ☆13Updated this week
- ☆10Aug 7, 2019Updated 6 years ago
- Databricks dbt factory library for creating Databricks Job definition where individual dbt models are run as separate tasks.☆20Jul 13, 2025Updated 7 months ago
- ☆13Nov 14, 2013Updated 12 years ago
- Operating documents for the technical steering committee.☆15Updated this week
- A collection of CMake modules to simplify the development of Boost libraries.☆10Apr 16, 2012Updated 13 years ago
- An implementation of Dijkstra in Clojure☆19Aug 7, 2012Updated 13 years ago
- Ansible Role for JBoss/Wildfly and JBoss-based products.☆11Oct 7, 2019Updated 6 years ago
- Parsing Module of Microsoft SQL Server Transaction log☆11May 12, 2023Updated 2 years ago
- An example for using `uv` with Ansible. It provides a local development Docker container and also a remote production container.☆15May 5, 2025Updated 10 months ago
- A simple project to analyse Malaysian airports - an opportunity to play with tools like Luigi, Docker, and Metabase as part of an end-to-…☆13Jul 25, 2023Updated 2 years ago
- ☆11Nov 11, 2024Updated last year
- Tabela Brasileira de Composição de Alimentos / Brazilian Food Composition Table☆16Nov 18, 2021Updated 4 years ago
- Bio-FlatScope reconstruction code☆13Mar 21, 2023Updated 2 years ago
- ☆12Aug 9, 2024Updated last year
- Fixed-width data source for Spark SQL and DataFrames☆10Oct 25, 2016Updated 9 years ago
- An sbt plugin to resolve dependencies using Aether☆13Apr 10, 2025Updated 10 months ago
- Arena allocator for Python objects.☆14Apr 25, 2020Updated 5 years ago
- CentOS docker images, build weekly with latest security updates☆11Mar 2, 2026Updated last week
- ☆11Jul 20, 2023Updated 2 years ago
- ☆12Jun 5, 2015Updated 10 years ago
- A fast way of getting a Spark cluster up and running on AWS with the friendly IPython interface.☆10May 8, 2015Updated 10 years ago
- Trino Iceberg Metadata Insights via Streamlit☆15Apr 9, 2025Updated 11 months ago
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆13Dec 11, 2023Updated 2 years ago
- 🧛🏻♂️ Dark theme for Rio Terminal☆18Sep 21, 2025Updated 5 months ago
- PredictHQ’s Data Science documentation☆14Feb 1, 2026Updated last month
- Knowledge sharing - Cheat sheets☆20Feb 28, 2026Updated last week
- GeonamesDump import data from geonames.org into your rails application☆19Apr 14, 2017Updated 8 years ago