neuralinkcorp / datarepoLinks
☆128Updated 2 weeks ago
Alternatives and similar repositories for datarepo
Users that are interested in datarepo are comparing it to the libraries listed below
Sorting:
- High-Performance Python Compute Engine for Data and AI☆306Updated this week
- Build reliable AI and agentic applications with DataFrames☆345Updated this week
- multi-engine batch transformation framework☆448Updated last week
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.☆21Updated 2 months ago
- Iceberg Playground in a Box☆67Updated 3 months ago
- Official Python API client library for turbopuffer☆78Updated last week
- Open Control Plane for Tables in Data Lakehouse☆369Updated this week
- Open-source repository for Semantic Modeling Language (SML)☆101Updated this week
- Unity Catalog UI☆43Updated last year
- ☆112Updated 3 weeks ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆194Updated last week
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆176Updated last week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆245Updated last week
- A curated list of awesome DBOS resources 😎☆79Updated 2 months ago
- Write-Audit-Publish on the lakehouse in pure Python with bauplan and DBOS☆13Updated 9 months ago
- DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles☆54Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- ☆58Updated last year
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆282Updated 2 weeks ago
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- A Benchmark for Real-Time Analytics Applications☆72Updated 3 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆93Updated 6 months ago
- Real-time data processing/feature engineering in Rust and Python. Tailored for modern AI/ML systems.☆70Updated this week
- ☆83Updated last year
- Autodiff in rust☆60Updated last week
- The Control Plane for Apache Iceberg.☆362Updated 2 weeks ago
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆13Updated 2 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆68Updated 4 months ago
- ☆59Updated 5 months ago