ColinEberhardt / awesome-public-streaming-datasetsView external linksLinks
A list of free datasets that provide streaming data
☆433May 16, 2024Updated last year
Alternatives and similar repositories for awesome-public-streaming-datasets
Users that are interested in awesome-public-streaming-datasets are comparing it to the libraries listed below
Sorting:
- A list of publicly available datasets with real-time data maintained by the team at bytewax.io☆2,289Dec 21, 2025Updated last month
- A service implementing the Carbon protocol and storing time series data using kairos☆42Mar 11, 2021Updated 4 years ago
- The klient utility provides a cli for basic kafka cluster operations and topic IO☆16Aug 21, 2025Updated 5 months ago
- ☆17Sep 13, 2021Updated 4 years ago
- Blazing fast and flexible JSON database.☆24Jan 2, 2017Updated 9 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- Gonudb is an append-only key/value datastore written in Go.☆19Dec 11, 2023Updated 2 years ago
- Improving your general programming knowledge☆14Oct 22, 2017Updated 8 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆535Jan 27, 2026Updated 2 weeks ago
- ☆23May 12, 2023Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆45Dec 11, 2023Updated 2 years ago
- A python tool scraping Aiven services metadata and building a connected graph☆14Aug 28, 2025Updated 5 months ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 2 months ago
- CLI tool to manage Kafka connectors☆10Mar 2, 2024Updated last year
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆23Apr 2, 2017Updated 8 years ago
- Python backend for "Data Query Languages" (like GraphQL and others)☆24Aug 23, 2015Updated 10 years ago
- A small library that allows to check if Go mutexes are locked☆27May 14, 2025Updated 9 months ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆837Apr 16, 2022Updated 3 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- Content Data Store (HDFS/HBase)☆13Dec 1, 2016Updated 9 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Jul 6, 2023Updated 2 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- 🍱 bento is an English-based automation language designed to be used by non-technical people.☆32Aug 7, 2019Updated 6 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆391Nov 28, 2023Updated 2 years ago
- A book, Let's build a DBMS: StellarSQL -- a minimal SQL DBMS written in Rust☆27Nov 8, 2018Updated 7 years ago
- Full stack cloud applications that combine infrastructure as code and front end codebases for cohesive end to end applications and exampl…☆15Aug 17, 2020Updated 5 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- A simple UI for Benthos!☆15Jul 21, 2022Updated 3 years ago
- declarative API testing☆14Sep 21, 2020Updated 5 years ago
- resources for career development in data science☆16Jun 24, 2020Updated 5 years ago
- A crowd sourced curriculum of mandatory material for new front-end devs.☆48Feb 8, 2016Updated 10 years ago
- PyTorch implementation of pix2code. 🔥☆27Mar 23, 2018Updated 7 years ago
- Lifecycle helpers for loading and unmounting css☆15Jun 19, 2025Updated 7 months ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Feb 3, 2026Updated last week
- Tweet Analysis with Spark☆14Aug 28, 2017Updated 8 years ago
- A book about Maven in the style of the Pragmatic Guides published by The Pragmatic Bookshelf☆11Dec 12, 2015Updated 10 years ago