Expand your Training Limits! Generating Training Data for ML-based Data Management
☆16Jul 12, 2022Updated 3 years ago
Alternatives and similar repositories for data-farm
Users that are interested in data-farm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Randomized Dependence Coefficient in Python☆20May 12, 2019Updated 7 years ago
- Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engin…☆148Jun 13, 2022Updated 4 years ago
- Scalytics Connect development environment, pre-build☆22Feb 21, 2024Updated 2 years ago
- PostBOUND is a research framework to prototype and benchmark database query optimizers☆26Jun 25, 2026Updated last week
- A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats…☆19Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Artifact evaluation for "E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series" accepted by WWW'24☆14Jul 29, 2024Updated last year
- Paper repository for "SWIRL: Selection of Workload-aware Indexes using Reinforcement Learning" (EDBT 2022)☆41Jul 12, 2025Updated 11 months ago
- ☆20Apr 11, 2022Updated 4 years ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆76Nov 8, 2024Updated last year
- A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…☆31Jul 15, 2025Updated 11 months ago
- Git相关资料☆11Apr 4, 2019Updated 7 years ago
- This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Sh…☆17Oct 16, 2015Updated 10 years ago
- This is the source code of the SIGMOD paper: "How Good are Learned Cost Models, Really? Insights From Query Optimization Tasks"☆29Jan 21, 2026Updated 5 months ago
- Factorize Sum Product Network☆30Nov 16, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆74Jan 20, 2023Updated 3 years ago
- This repository includes the code base used in the paper "Is Your Learned Query Optimizer Behaving As You Expect? A Machine Learning Pers…☆20Feb 22, 2024Updated 2 years ago
- Query-based Workload Forecasting for Self-Driving DBMS☆102Oct 8, 2022Updated 3 years ago
- A comprehensive jmh-based benchmark on various RTree configurations for Dave Moten's rtree implementation.☆11Mar 24, 2017Updated 9 years ago
- ☆39Jul 6, 2023Updated 2 years ago
- A prototype implementation of Bao for PostgreSQL☆221Sep 17, 2024Updated last year
- PostgreSQL extension for recoding workload of specific database☆12Mar 21, 2016Updated 10 years ago
- Cardinality Estimation via Learned Dynamic Sample Selection☆10Jan 8, 2024Updated 2 years ago
- ☆10Feb 12, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆14Aug 15, 2021Updated 4 years ago
- R* Tree java implementation☆12Sep 1, 2016Updated 9 years ago
- A Study of Database Performance Sensitivity to Experiment Settings☆11May 31, 2022Updated 4 years ago
- ☆14Apr 24, 2023Updated 3 years ago
- 截取iPad上京东阅读的书籍,自动保存为pdf☆11Nov 27, 2018Updated 7 years ago
- ☆36Feb 11, 2025Updated last year
- PyTorch implementation of binary tree convolution☆50Jan 3, 2020Updated 6 years ago
- Scalable Graph Mining☆64Nov 13, 2022Updated 3 years ago
- ☆64May 12, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆54Mar 3, 2024Updated 2 years ago
- ☆11Nov 2, 2022Updated 3 years ago
- A straightforward implementation of EGBM-based Generalized Additive Model☆13Oct 15, 2020Updated 5 years ago
- The casbin extension for Hertz.☆11Feb 20, 2023Updated 3 years ago
- Code on paper: Eraser: Eliminating Performance Regression on Learned Query Optimizer☆12Nov 10, 2023Updated 2 years ago
- ☆11Nov 12, 2020Updated 5 years ago
- PilotScope is a middleware to bridge the gaps of deploying AI4DB (Artificial Intelligence for Databases) algorithms into actual database …☆168Oct 12, 2024Updated last year