neuralinkcorp / datarepoLinks
☆159Updated 4 months ago
Alternatives and similar repositories for datarepo
Users that are interested in datarepo are comparing it to the libraries listed below
Sorting:
- Official Python API client library for turbopuffer☆102Updated this week
- A compute manifest and composable tools for ML, built on Ibis, DataFusion, and Arrow Flight.☆481Updated this week
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆295Updated 4 months ago
- Declarative context engineering for agents☆434Updated last week
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆396Updated 2 months ago
- ☆117Updated last week
- High Performance Data Processing in Python☆351Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Updated last year
- Embeddable stream processing engine based on Apache DataFusion☆373Updated last year
- Iceberg Playground in a Box☆67Updated 7 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆96Updated 10 months ago
- Unity Catalog UI☆43Updated last year
- Reusable data engineering toolkit My personal data infrastructure☆19Updated 3 months ago
- New file format for storage of large columnar datasets.☆686Updated this week
- Real-time data processing/feature engineering in Rust and Python. Tailored for modern AI/ML systems.☆74Updated this week
- Autodiff in rust☆60Updated 3 months ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Updated 11 months ago
- RAG based agent with chDB(ClickHouse)☆22Updated 8 months ago
- ☆131Updated 6 months ago
- A curated list of awesome DBOS resources 😎☆93Updated 3 months ago
- Run Graph Queries with Lance☆91Updated last week
- Template to quickstart streaming analytics using Apache Kafka for ingestion, QuestDB for time-series storage and analytics, Grafana for n…☆103Updated 8 months ago
- Apache Hive Metastore in Standalone Mode With Docker☆14Updated last year
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆137Updated 2 months ago
- dbc is a command-line tool for installing and managing ADBC drivers☆79Updated last week
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆13Updated 2 years ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated last week
- The CLI for GPUs☆141Updated 2 months ago
- A catalogue of existing Nanda servers☆190Updated 9 months ago
- Open Control Plane for Tables in Data Lakehouse☆379Updated last week