neuralinkcorp / datarepoLinks
☆147Updated 2 months ago
Alternatives and similar repositories for datarepo
Users that are interested in datarepo are comparing it to the libraries listed below
Sorting:
- Build reliable AI and agentic applications with DataFrames☆415Updated this week
- High Performance Data Processing in Python☆337Updated this week
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.☆25Updated last month
- ☆115Updated 2 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆85Updated last year
- a compute manifest and tools for ML☆463Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆216Updated last year
- Unity Catalog UI☆43Updated last year
- CookieCutter template for getting started with Flyte python projects☆21Updated 8 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Iceberg Playground in a Box☆67Updated 5 months ago
- Official Python API client library for turbopuffer☆91Updated this week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆95Updated 8 months ago
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆292Updated 2 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆95Updated 9 months ago
- Open-source repository for Semantic Modeling Language (SML)☆119Updated last week
- Real-time data processing/feature engineering in Rust and Python. Tailored for modern AI/ML systems.☆72Updated last week
- ☆59Updated last year
- Open Control Plane for Tables in Data Lakehouse☆375Updated this week
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆186Updated last week
- ☆62Updated 7 months ago
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com☆60Updated this week
- Experimental version. A BYOC option for Snowflake workloads☆102Updated this week
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆294Updated last month
- dbc is a command-line tool for installing and managing ADBC drivers☆65Updated this week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆14Updated 9 months ago
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 6 months ago
- Prepare requirements and deploy Flyte using Helm☆79Updated 8 months ago
- Easy to use cluster-compute software.☆214Updated this week
- 🛡️ Managed isolated environments for Python☆105Updated 2 weeks ago