treeverse / lakeFS
lakeFS - Data version control for your data lake | Git for data
☆4,354Updated this week
Related projects: ⓘ
- Compare tables within or across databases☆2,933Updated 4 months ago
- The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.☆5,723Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆3,789Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆1,779Updated this week
- Apache DataFusion SQL Query Engine☆5,913Updated this week
- Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.☆1,871Updated 3 weeks ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,540Updated 4 months ago
- An orchestration platform for the development, production, and observation of data assets.☆11,155Updated this week
- Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time E…☆6,783Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆984Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆10,189Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆5,418Updated this week
- Apache Iceberg☆6,161Updated this week
- 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://data…☆7,684Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,391Updated last week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆1,639Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,155Updated this week
- Malloy is an experimental language for describing data relationships and transformations.