Expand your Training Limits! Generating Training Data for ML-based Data Management
☆16Jul 12, 2022Updated 3 years ago
Alternatives and similar repositories for data-farm
Users that are interested in data-farm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Randomized Dependence Coefficient in Python☆20May 12, 2019Updated 6 years ago
- Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engin…☆146Jun 13, 2022Updated 3 years ago
- Scalytics Connect development environment, pre-build☆22Feb 21, 2024Updated 2 years ago
- PostBOUND is a research framework to prototype and benchmark database query optimizers☆25Apr 1, 2026Updated last week
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆12Aug 12, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Artifact evaluation for "E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series" accepted by WWW'24☆13Jul 29, 2024Updated last year
- Paper repository for "SWIRL: Selection of Workload-aware Indexes using Reinforcement Learning" (EDBT 2022)☆40Jul 12, 2025Updated 9 months ago
- ☆20Apr 11, 2022Updated 4 years ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆74Nov 8, 2024Updated last year
- Apache Wayang is the first cross-platform data processing system.☆264Updated this week
- Medical natural language parsing and utility library☆14Dec 10, 2025Updated 4 months ago
- This is the source code of the SIGMOD paper: "How Good are Learned Cost Models, Really? Insights From Query Optimization Tasks"☆29Jan 21, 2026Updated 2 months ago
- Factorize Sum Product Network☆30Nov 16, 2022Updated 3 years ago
- ☆72Jan 20, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jul 13, 2022Updated 3 years ago
- This repository includes the code base used in the paper "Is Your Learned Query Optimizer Behaving As You Expect? A Machine Learning Pers…☆18Feb 22, 2024Updated 2 years ago
- Query-based Workload Forecasting for Self-Driving DBMS☆103Oct 8, 2022Updated 3 years ago
- ☆39Jul 6, 2023Updated 2 years ago
- A prototype implementation of Bao for PostgreSQL☆216Sep 17, 2024Updated last year
- PostgreSQL extension for recoding workload of specific database☆12Mar 21, 2016Updated 10 years ago
- ☆10Feb 12, 2021Updated 5 years ago
- PostgreSQL extension to log SQL statements for specific server processes.☆12Feb 17, 2024Updated 2 years ago
- R* Tree java implementation☆12Sep 1, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Study of Database Performance Sensitivity to Experiment Settings☆10May 31, 2022Updated 3 years ago
- ☆14Apr 24, 2023Updated 2 years ago
- A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries☆14Oct 25, 2020Updated 5 years ago
- Tools to help with LaTeX paper writing☆17Aug 15, 2023Updated 2 years ago
- 截取iPad上京东阅读的书籍,自动保存为pdf☆11Nov 27, 2018Updated 7 years ago
- ☆36Feb 11, 2025Updated last year
- ☆13Nov 8, 2021Updated 4 years ago
- PyTorch implementation of binary tree convolution☆49Jan 3, 2020Updated 6 years ago
- Scalable Graph Mining☆63Nov 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆62May 12, 2024Updated last year
- The casbin extension for Hertz.☆11Feb 20, 2023Updated 3 years ago
- Limited automatic tabular ML pipelines for generic MEDS datasets.☆18Mar 13, 2026Updated last month
- Code on paper: Eraser: Eliminating Performance Regression on Learned Query Optimizer☆12Nov 10, 2023Updated 2 years ago
- ☆11Nov 12, 2020Updated 5 years ago
- PilotScope is a middleware to bridge the gaps of deploying AI4DB (Artificial Intelligence for Databases) algorithms into actual database …☆168Oct 12, 2024Updated last year
- Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"☆54Nov 11, 2022Updated 3 years ago