A multi-modal Python library for benchmarking lakehouse engines and ELT scenarios, supporting both industry-standard and novel benchmarks.
☆45Mar 12, 2026Updated last week
Alternatives and similar repositories for LakeBench
Users that are interested in LakeBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 1, 2025Updated 3 months ago
- A Model Context Protocol (MCP) server that enables AI assistants to securely access and analyze Microsoft Fabric Analytics data through a…☆106Dec 23, 2025Updated 3 months ago
- ☆47Mar 3, 2026Updated 2 weeks ago
- dbt adapter for dbt serverless pools☆13Mar 1, 2023Updated 3 years ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆24Mar 9, 2026Updated 2 weeks ago
- ☆15May 27, 2025Updated 9 months ago
- ☆16Mar 5, 2025Updated last year
- ☆12Oct 17, 2022Updated 3 years ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 9 months ago
- ☆57Updated this week
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16May 11, 2021Updated 4 years ago
- The source for REST API specifications for Microsoft Fabric.☆35Updated this week
- Python Package for ducklake☆20Jun 5, 2025Updated 9 months ago
- ☆18Feb 16, 2022Updated 4 years ago
- FabricFlow is a code-first Python SDK for building, managing, and automating Microsoft Fabric data pipelines, workspaces, and core items.…☆25Jan 16, 2026Updated 2 months ago
- ☆15Aug 28, 2025Updated 6 months ago
- Simple example on how to setup Opentelemetry to aid you during development☆14Oct 3, 2022Updated 3 years ago
- Project for open sources demos for Excel Mirroring using Open Mirroing☆21Dec 2, 2024Updated last year
- Data Catalog metadata search service (part of DataHub data management system)☆10Jan 30, 2023Updated 3 years ago
- Track Peer location using socket.io client and browser geolocation api☆11May 30, 2021Updated 4 years ago
- The repository contains Python code for a data pipeline that ingests files from Azure Blob Storage, extracts and chunks text, redacts sen…☆14Jan 13, 2026Updated 2 months ago
- Virtual Enigma and Bombe eBPF simulation for real-time network packet encryption and cryptanalysis on Linux☆18Aug 4, 2025Updated 7 months ago
- Monad flavored Hardhat☆14Nov 23, 2025Updated 4 months ago
- conversion between PySpark and Polars DataFrames☆21Updated this week
- This repository supports the content for the book of the same name from Manning☆14Nov 17, 2025Updated 4 months ago
- A Postgres extension that rewrites strings to 💩☆21May 14, 2023Updated 2 years ago
- Retail Search with AI☆14Feb 14, 2026Updated last month
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- Data Package reader for Pandas☆19Feb 10, 2023Updated 3 years ago
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆14Mar 14, 2026Updated last week
- ☆21May 15, 2021Updated 4 years ago
- An easy interface for documenting data packages☆20Apr 12, 2018Updated 7 years ago
- Decompiler for Power Query code☆20Mar 29, 2018Updated 7 years ago
- This application allows you to import your data, transform it using AI to perform calculations, and report and export the results.☆39Mar 8, 2026Updated 2 weeks ago
- GitHub Action for Continuous Profiling which you can run to profile your CI/CD. It uses parca and Polar Signals cloud.☆15Feb 10, 2026Updated last month
- A cloud native data mesh implementation☆12Jan 15, 2021Updated 5 years ago
- ☆23May 26, 2025Updated 9 months ago
- Example GitHub Actions for Apache Kafka client application development for local and Confluent Cloud☆15Aug 1, 2022Updated 3 years ago