volga-project / volga
Data Processing/Feature Calculation Engine for Real-Time AI/ML
☆40Updated this week
Alternatives and similar repositories for volga:
Users that are interested in volga are comparing it to the libraries listed below
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Open Benchmarks for Evaluating the Performance of Feature Stores☆35Updated 11 months ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆102Updated 2 months ago
- Python stream processing for analytics☆34Updated 3 weeks ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 9 months ago
- ☆10Updated 2 years ago
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆15Updated last week
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆31Updated 2 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated 2 years ago
- ☆85Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- do-anything, run-anywhere pandas-style pipelines☆94Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 5 months ago
- ☆22Updated this week
- A software engineering framework to jump start your machine learning projects☆37Updated 8 months ago
- Unity Catalog UI☆39Updated 6 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Apache DataFusion Ray☆168Updated this week
- ☆55Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- The Internals of PySpark☆26Updated 2 months ago
- ☆36Updated this week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆48Updated this week