volga-project / volga
Feature Engine for real-time AI/ML
☆36Updated last week
Related projects ⓘ
Alternatives and complementary repositories for volga
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆17Updated last year
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆46Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆95Updated last month
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆165Updated 2 weeks ago
- Apache DataFusion Ray☆117Updated this week
- Open Benchmarks for Evaluating the Performance of Feature Stores☆35Updated 8 months ago
- ☆76Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- real-time data + ML pipeline☆54Updated this week
- ☆10Updated last year
- Arrow, pydantic style☆82Updated last year
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆11Updated this week
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆95Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆171Updated 10 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 4 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆45Updated last year
- Demos of Materialize, the operational data warehouse.☆50Updated 2 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆74Updated last month
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☆27Updated last year
- ☆130Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- ☆30Updated 3 years ago
- Python stream processing for analytics☆24Updated this week
- A work-in-progress book on Dask☆12Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆50Updated 9 months ago