A novel approach for synthesizing tabular data using pretrained large language models
☆351Feb 9, 2026Updated last month
Alternatives and similar repositories for be_great
Users that are interested in be_great are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆244Jan 4, 2026Updated 2 months ago
- Official git for "TabuLa: Harnessing Language Models for Tabular Data Synthesis"☆45Jul 30, 2025Updated 7 months ago
- Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""☆194Jul 15, 2024Updated last year
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆537Jul 13, 2024Updated last year
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆645Feb 11, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official code for "STaSy: Score-based Tabular data Synthesis", ICLR 2023☆35Aug 11, 2023Updated 2 years ago
- ☆26Mar 9, 2023Updated 3 years ago
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆95Jan 19, 2024Updated 2 years ago
- Official GitHub for CTAB-GAN+☆85May 14, 2024Updated last year
- Benchmarking synthetic data generation methods.☆304Updated this week
- Experiments on Tabular Data Models☆279May 25, 2023Updated 2 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆256Mar 19, 2026Updated last week
- Official code for "CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis", ICML 2023☆37Jan 9, 2024Updated 2 years ago
- A modular Python framework for standardized evaluation and benchmarking of online learning models.☆10Nov 24, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A framework for prototyping and benchmarking imputation methods☆196Apr 4, 2023Updated 2 years ago
- Software for evaluating the quality of synthetic data compared with real data.☆36Feb 13, 2026Updated last month
- Conditional GAN for generating synthetic tabular data.☆1,536Mar 16, 2026Updated last week
- The code for the paper "MediTab: Scaling Medical Tabular Data Predictors via Data Consolidation, Enrichment, and Refinement"☆23May 8, 2024Updated last year
- Code and data for "TURL: Table Understanding through Representation Learning"☆136Nov 23, 2025Updated 4 months ago
- A repo for transfer learning with deep tabular models☆104Feb 15, 2023Updated 3 years ago
- ☆341Nov 2, 2023Updated 2 years ago
- Instance-based uncertainty estimation for gradient-boosted regression trees☆32Jul 25, 2024Updated last year
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Synthetic data generation for tabular data☆3,451Updated this week
- Code for Transformed Distribution Matching (TDM) for Missing Value Imputation, ICML 2023☆14Aug 4, 2023Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- [ICLR 2024 spotlight] Making Pre-trained Language Models Great on Tabular Prediction☆68Jul 12, 2024Updated last year
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆565Mar 7, 2026Updated 2 weeks ago
- Implementation of the paper: "FedTabDiff: Federated Learning of Diffusion Models for Synthetic Mixed-Type Tabular Data Generation"☆23Nov 10, 2024Updated last year
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆5,890Updated this week
- A library of Reversible Data Transforms☆133Feb 23, 2026Updated last month
- NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables☆213Mar 13, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- Dual Adversarial Autoencoder for Generating Set-valued Sequences☆19Jan 15, 2021Updated 5 years ago
- git for paper "CTAB-GAN: Effective Table Data Synthesizing"☆13Apr 25, 2022Updated 3 years ago
- ☆24Sep 16, 2022Updated 3 years ago
- Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024 Spotlight]☆110Nov 8, 2025Updated 4 months ago
- ☆15May 19, 2025Updated 10 months ago
- Implementation of SANTOS: Relationship-based Semantic Table Union Search.☆13Nov 21, 2023Updated 2 years ago