volga-project / volga
Data Processing/Feature Calculation Engine for real-time AI/ML
☆40Updated this week
Alternatives and similar repositories for volga:
Users that are interested in volga are comparing it to the libraries listed below
- Open Benchmarks for Evaluating the Performance of Feature Stores☆35Updated 11 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 8 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆100Updated last month
- Python stream processing for analytics☆31Updated last week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆190Updated this week
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆99Updated this week
- ☆84Updated last month
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- Next generation compute platform for the post-modern data stack☆13Updated this week
- The Internals of PySpark☆25Updated last month
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆75Updated this week
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆15Updated this week
- Apache DataFusion Ray☆158Updated this week
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆49Updated 2 years ago
- Ray-based Apache Beam runner☆43Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 6 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆16Updated this week
- ☆35Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- Friendly ML feature store☆45Updated 2 years ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆49Updated 9 months ago
- Demos of Materialize, the operational data warehouse.☆51Updated 5 months ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆79Updated 2 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- ☆34Updated 11 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 4 months ago