Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.
☆1,105Feb 27, 2026Updated last week
Alternatives and similar repositories for Oxen
Users that are interested in Oxen are comparing it to the libraries listed below
Sorting:
- Deprecated: We moved this to Oxen-AI/Oxen core☆241Aug 26, 2025Updated 6 months ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,123Updated this week
- PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement☆10,732Feb 26, 2026Updated last week
- A query engine for any combination of data sources. Query your files and APIs as if they were databases!☆2,837Updated this week
- An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data☆10,274Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,582Updated this week
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.☆14,473Updated this week
- Distributed stream processing engine in Rust☆4,827Feb 25, 2026Updated last week
- Create full-fledged APIs for slowly moving datasets without writing a single line of code.☆3,409Dec 23, 2025Updated 2 months ago
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,268Updated this week
- A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!☆3,896Dec 4, 2024Updated last year
- Minimalist ML framework for Rust☆19,509Updated this week
- A Python tool to visualize + enforce dependencies, using modular architecture 🌎 Open source 🐍 Installable via pip 🔧 Able to be adopted…☆2,655Feb 24, 2026Updated last week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,247Updated this week
- Dolt – Git for Data☆20,314Updated this week
- PyGWalker: Turn your dataframe into an interactive UI for visual analysis☆15,660Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,742Updated this week
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆15,917Updated this week
- Deep learning at the speed of light.☆2,775Updated this week
- 🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.☆5,174Updated this week
- 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.☆21,149Feb 17, 2026Updated 2 weeks ago
- Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.☆10,925Updated this week
- 🤖 Just a command runner☆31,732Feb 16, 2026Updated 2 weeks ago
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,150Jun 24, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,247Feb 25, 2026Updated last week
- Scratch is a swiss army knife for big data.☆1,117Jul 19, 2024Updated last year
- A Git-compatible VCS that is both simple and powerful☆26,259Updated this week
- 🦉 Data Versioning and ML Experiments☆15,404Feb 27, 2026Updated last week
- Postgres with GPUs for ML/AI apps.☆6,727Jul 1, 2025Updated 8 months ago
- A distributed system for running WebSocket services at scale.☆2,013Jan 13, 2026Updated last month
- 🪓 Run Background Tasks at Scale☆6,664Updated this week
- A data visualization and analytics component, especially well-suited for large and/or streaming datasets.☆10,346Feb 24, 2026Updated last week
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,174Updated this week
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆5,975Feb 18, 2026Updated 2 weeks ago
- Apache DataFusion SQL Query Engine☆8,462Updated this week
- 📢 Laudspeaker is an Open Source Customer Engagement and Product Onboarding Platform. Open Source alternative to Braze / One Signal / C…☆2,564Oct 13, 2025Updated 4 months ago
- A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.☆2,813Updated this week
- Linear algebra foundation for the Rust programming language☆2,476Jan 26, 2026Updated last month
- The framework for building with WebAssembly (wasm). Easily & securely load wasm modules, move data, call functions, and build extensible …☆5,445Feb 24, 2026Updated last week