A benchmark tool for lakehouses.
☆14Mar 12, 2023Updated 3 years ago
Alternatives and similar repositories for lakehouse-benchmark
Users that are interested in lakehouse-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script that fetches comments from a TikTok post☆16Apr 27, 2023Updated 2 years ago
- ☆17Apr 8, 2023Updated 3 years ago
- challenge☆31Aug 27, 2020Updated 5 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆23Oct 15, 2024Updated last year
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Dec 11, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Lakehouse storage system benchmark☆79Feb 22, 2023Updated 3 years ago
- Scala solutions for hackerrank☆11Nov 20, 2016Updated 9 years ago
- A Memory-efficient Graph Store for Interactive Queries☆13Sep 1, 2021Updated 4 years ago
- Real time face recognition with tracking (mtcnn detection, kcf tracker, arcface loss)☆38Mar 11, 2019Updated 7 years ago
- Use JavaCPP and JavaCPP presets with ease. Base plugin for JavaCPP-related projects.☆39Dec 20, 2020Updated 5 years ago
- Read Typeclass For Scala☆13May 24, 2022Updated 3 years ago
- pku nlp toolkit☆10Jun 5, 2018Updated 7 years ago
- Scala News - A Community Crowd Sourced newsletter using RSS☆37Mar 23, 2026Updated 3 weeks ago
- A simple graph library for Scala☆10Feb 5, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,122Apr 9, 2026Updated last week
- SQL scripts, instructions for MySQL HeatWave benchmarking☆12Mar 17, 2024Updated 2 years ago
- This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such a…☆12Jul 30, 2024Updated last year
- Node.js kafka connect connector for prometheus☆13Dec 7, 2022Updated 3 years ago
- An minimal example for a full stack Scala app on Heroku☆10Mar 5, 2020Updated 6 years ago
- Sina News Crawler and Word Segmentation☆13Dec 20, 2017Updated 8 years ago
- Stream Data from Databricks Directly to PowerBI, and CosmosDB!☆12Sep 25, 2018Updated 7 years ago
- ☆13May 22, 2023Updated 2 years ago
- Full stack skeleton project using Akka-http, Scala.js, Laminar, Sloth, Boopickle☆15Sep 1, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Bringing Spire to Dotty/Scala 3☆14Feb 19, 2024Updated 2 years ago
- Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP☆10Jul 18, 2022Updated 3 years ago
- Implement node2vec algorithm using Spark 2 from: http://snap.stanford.edu/node2vec/☆11Jul 10, 2019Updated 6 years ago
- An interactive Agda tutorial☆21Updated this week
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆31Nov 9, 2023Updated 2 years ago
- Giter8 template of a Udash application.☆19Mar 21, 2023Updated 3 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- Minimalistic example of using Laminar and Scala.js to publish static Github Pages website☆14Jan 29, 2025Updated last year
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Jan 13, 2017Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- A high throughput GC-MS analysis pipeline built on the Python PyMS library☆11Feb 12, 2018Updated 8 years ago
- NeurIPS 2020☆17Jun 18, 2021Updated 4 years ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- Databricks ML in Action, Published by Packt☆34Mar 2, 2026Updated last month
- A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight☆14Jun 4, 2020Updated 5 years ago
- R interface to Azure Data Explorer, aka Kusto☆19Sep 9, 2025Updated 7 months ago