dremio/dremio-oss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dremio/dremio-oss)

dremio / dremio-oss

Dremio - the missing link in modern data

☆1,487

Alternatives and similar repositories for dremio-oss

Users that are interested in dremio-oss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,481Updated this week
dremio / arrow
View on GitHub
Mirror of Apache Arrow
☆33Jun 16, 2026Updated last month
dremio / dremio-cloud-tools
View on GitHub
Dremio Container Tools
☆165Aug 26, 2025Updated 10 months ago
dremio / gandiva
View on GitHub
Vectorized processing for Apache Arrow
☆484Feb 14, 2022Updated 4 years ago
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,917Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
apache / arrow
View on GitHub
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆16,941Updated this week
Teradata / kylo
View on GitHub
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…
☆1,111Jan 12, 2023Updated 3 years ago
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,020Updated this week
apache / iceberg
View on GitHub
Apache Iceberg
☆9,062Updated this week
trinodb / trino
View on GitHub
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
☆13,046Updated this week
apache / calcite
View on GitHub
Apache Calcite
☆5,157Updated this week
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,193Updated this week
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,719Updated this week
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,116Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / datafusion
View on GitHub
Apache DataFusion SQL Query Engine
☆8,996Updated this week
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,352Updated this week
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,782Jul 1, 2026Updated 2 weeks ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,212Apr 29, 2025Updated last year
dremio-hub / dremio-flight-connector
View on GitHub
Dremio Flight connector. Access Dremio using Arrow flight
☆38Dec 11, 2020Updated 5 years ago
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,535Updated this week
dbt-labs / dbt-core
View on GitHub
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…
☆13,475Updated this week
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,033Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
airbytehq / airbyte
View on GitHub
Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both …
☆21,654Updated this week
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
OpenLineage / OpenLineage
View on GitHub
An Open Standard for lineage metadata collection
☆2,549Updated this week
treeverse / lakeFS
View on GitHub
lakeFS - Data version control for your data lake | Git for data
☆5,454Updated this week
heavyai / heavydb
View on GitHub
HeavyDB (formerly MapD/OmniSciDB)
☆3,055Jun 25, 2026Updated 3 weeks ago
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,298Updated this week
databendlabs / databend
View on GitHub
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.
☆9,390Updated this week
MaterializeInc / materialize
View on GitHub
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
☆6,334Updated this week
dremio / dbt-dremio
View on GitHub
dbt (data build tool) adapter for the Dremio
☆57Jun 9, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MarquezProject / marquez
View on GitHub
Collect, aggregate, and visualize a data ecosystem's metadata
☆2,242Jul 6, 2026Updated last week
apache / ignite
View on GitHub
Apache Ignite
☆5,073Updated this week
apache / polaris
View on GitHub
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆2,017Updated this week
uber-archive / AthenaX
View on GitHub
SQL-based streaming analytics platform at scale
☆1,224Jun 21, 2020Updated 6 years ago
fabrice-etanchaud / dbt-dremio
View on GitHub
dbt's adapter for dremio
☆48Oct 15, 2022Updated 3 years ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
debezium / debezium
View on GitHub
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
☆12,931Updated this week