neuralinkcorp / datarepoLinks
☆123Updated 2 weeks ago
Alternatives and similar repositories for datarepo
Users that are interested in datarepo are comparing it to the libraries listed below
Sorting:
- Build reliable AI and agentic applications with DataFrames☆255Updated this week
- High-Performance Python Compute Engine for Data and AI☆300Updated this week
- Official Python API client library for turbopuffer☆70Updated this week
- Catalog, compose, and ship multi-engine Python expressions.☆406Updated this week
- A curated list of awesome DBOS resources 😎☆76Updated 3 weeks ago
- Embeddable stream processing engine based on Apache DataFusion☆349Updated 8 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 11 months ago
- Iceberg Playground in a Box☆62Updated 2 months ago
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.☆21Updated last month
- ☆82Updated last year
- A Doom-like game using DuckDB☆92Updated 4 months ago
- Real-time data processing/feature engineering in Python and Rust. Tailored for modern AI/ML systems.☆67Updated last week
- A FastMCP tool to search and retrieve Polars API documentation.☆68Updated 3 months ago
- ☆101Updated this week
- New file format for storage of large columnar datasets.☆603Updated this week
- The Control Plane for Apache Iceberg.☆338Updated 3 weeks ago
- The simplest way to run Python on lot's of computers.☆113Updated this week
- A Benchmark for Real-Time Analytics Applications☆70Updated last month
- Unity Catalog UI☆42Updated 11 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆212Updated last year
- An open-source, community-driven REST catalog for Apache Iceberg!☆29Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Open Control Plane for Tables in Data Lakehouse☆369Updated last week
- Apache Hive Metastore in Standalone Mode With Docker☆14Updated last year
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆13Updated 2 years ago
- 🛡️ Managed isolated environments for Python☆99Updated 2 months ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆173Updated last week
- A BYOC option for Snowflake workloads☆89Updated last week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 6 months ago
- ☆48Updated 2 months ago