Build Data Lake using Open Source tools
☆126May 27, 2025Updated 10 months ago
Alternatives and similar repositories for openlake
Users that are interested in openlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Find bottlenecks in distributed network☆23Dec 8, 2020Updated 5 years ago
- Collection of assets used for various articles at https://blogs.min.io☆42Apr 9, 2026Updated last week
- Spark Streaming Checkpoint File Manager for MinIO☆11Apr 25, 2023Updated 2 years ago
- ☆16Mar 9, 2026Updated last month
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆45Mar 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- How to use Presto (with Hive metastore) and MinIO?☆28Mar 8, 2023Updated 3 years ago
- ☆14Jun 25, 2025Updated 9 months ago
- A Prometheus exporter for Minio cloud storage server☆23Apr 24, 2018Updated 7 years ago
- Play blindfold chess against any UCI compatible engines.☆12Dec 4, 2023Updated 2 years ago
- Collection of tests to detect overall correctness of MinIO server.☆102Jan 8, 2026Updated 3 months ago
- A thumbnail generator example using Minio's listenBucketNotification API☆105Jun 2, 2022Updated 3 years ago
- Miscellaneous codes and writings for MLOps☆15Apr 8, 2026Updated last week
- Extends `payloadcms` with the ability to login through Zitadel☆12Feb 25, 2026Updated last month
- sample project to use Minio as Laravel cloud file system☆17Sep 20, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Oct 22, 2025Updated 5 months ago
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated 2 years ago
- Drive performance measurement tool☆77Dec 29, 2025Updated 3 months ago
- ☆21Mar 25, 2024Updated 2 years ago
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆12Jan 27, 2025Updated last year
- How to customize Tableau authentication using the AWS Athena's JDBC Credentials Provider capabilites.☆14Jun 8, 2020Updated 5 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated 2 years ago
- Collection of benchmarks captured for MinIO server.☆31Jan 12, 2018Updated 8 years ago
- The MinIO Admin Go Client SDK provides APIs to manage MinIO services☆119Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆311Updated this week
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- Apache Polaris Tools, additional tooling for Apache Polaris☆27Updated this week
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆24Nov 29, 2021Updated 4 years ago
- Ansible Role - Containers☆14Jun 15, 2022Updated 3 years ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15Mar 16, 2026Updated last month
- Terraform module to manage Network Load Balancer resources within the Yandex.Cloud.☆12Mar 23, 2026Updated 3 weeks ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆48Oct 14, 2024Updated last year
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gadsme Helm chart repository☆56Nov 14, 2025Updated 5 months ago
- Determining the important factors that influences the customer or passenger satisfaction of an airlines using CRISP-DM methodology in Pyt…☆26Sep 1, 2023Updated 2 years ago
- This repo contains DAGs demonstrating a variety of ELT patterns using Airflow along with dbt.☆12Jan 12, 2023Updated 3 years ago
- Manage PCI Devices and PCI Device Claims for PCI Passthrough in Harvester☆19Apr 1, 2026Updated 2 weeks ago
- Ansible Collection for GitLab☆18Updated this week
- Ansible role to install the MinIO https://min.io☆28Apr 10, 2026Updated last week
- Set of Go tools to check different elements of your stack (SSL, SMTP, Permissions...)☆24Updated this week