volga-project / volga
Real-time data processing/feature engineering in Python. Tailored for modern AI/ML systems.
☆49Updated this week
Alternatives and similar repositories for volga:
Users that are interested in volga are comparing it to the libraries listed below
- ☆89Updated 3 weeks ago
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 2 months ago
- Apache DataFusion Ray☆183Updated 2 weeks ago
- Arrow, pydantic style☆82Updated 2 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆11Updated 2 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆100Updated this week
- ☆11Updated 2 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆27Updated this week
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- Python stream processing for analytics☆36Updated last month
- Python stream processing with RisingWave☆16Updated last week
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆16Updated this week
- deferred computational framework for multi-engine pipelines☆234Updated this week
- Demos of Materialize, the operational data warehouse.☆52Updated last month
- Open Benchmarks for Evaluating the Performance of Feature Stores☆35Updated last year
- A Python Client for Hive Metastore☆12Updated last year
- An open-source, community-driven REST catalog for Apache Iceberg!☆27Updated 9 months ago
- Apache Arrow Ballista Python bindings☆37Updated last year
- Python driver for Timeplus Enterprise or Timeplus Proton☆14Updated 4 months ago
- Template to quickstart streaming analytics using Apache Kafka for ingestion, QuestDB for time-series storage and analytics, Grafana for n…☆83Updated 4 months ago
- ☆22Updated last month
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆39Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 6 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆234Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆209Updated this week