An end-to-end workflow for processing streaming data on Azure.
☆17Sep 20, 2024Updated last year
Alternatives and similar repositories for stream-iot
Users that are interested in stream-iot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Image scraper for DuckDuckGo and Google for creating DL datasets☆22Sep 18, 2020Updated 5 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆146Aug 23, 2023Updated 2 years ago
- Steve's coffee shop recipe project for the Pluralsight Course "Git Fundamentals"☆21Mar 13, 2023Updated 3 years ago
- ☆16Jan 19, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Jan 23, 2023Updated 3 years ago
- A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker☆16Mar 7, 2023Updated 3 years ago
- Jupyter Notebook that demonstrates LLM-driven voter file data engineering, using LangChain to write SQL and Python☆36Apr 4, 2023Updated 3 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 8 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Cloud based Data Platform based on Apache Spark☆28Apr 24, 2026Updated last week
- CLI tool for transforming collections of tabular source data into a variety of text-based data formats via YAML configuration and Jinja t…☆26Apr 16, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Dec 8, 2025Updated 4 months ago
- 🟣 Recommendation Systems interview questions and answers to help you prepare for your next machine learning and data science interview i…☆62Jan 4, 2026Updated 4 months ago
- Provides syntax highlighting for Apptainer/Singularity definition files.☆10Dec 24, 2025Updated 4 months ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- A Twisted-based Kubernetes client.☆12Dec 18, 2018Updated 7 years ago
- Simulate and visualize the processing of food orders☆11Feb 2, 2024Updated 2 years ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 6 months ago
- ☆12Aug 1, 2025Updated 9 months ago
- ☆36Jun 9, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a …☆28Apr 7, 2021Updated 5 years ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- benchmarks for LLM tokenizers☆18Mar 25, 2026Updated last month
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆35Dec 12, 2023Updated 2 years ago
- Always-on AI agent orchestration in the cloud. Deploy Gas Town to a VPS, access from anywhere via Tailscale.☆31Jan 4, 2026Updated 4 months ago
- sketch + search = skrch☆21Apr 4, 2013Updated 13 years ago
- Application for checking performance of elevator group system in building using simulation method.☆12Nov 9, 2017Updated 8 years ago
- ☆18Nov 11, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- C library for efficient string matching with Aho-Corasick☆21Jan 20, 2012Updated 14 years ago
- Professional Wargaming LLM Toolbox☆25Jul 9, 2025Updated 9 months ago
- Datasets for CS109☆28Oct 8, 2013Updated 12 years ago
- Deploy a distroless Rust API to Azure☆23Jan 19, 2026Updated 3 months ago