A novel approach for synthesizing tabular data using pretrained large language models
☆352Feb 9, 2026Updated 2 months ago
Alternatives and similar repositories for be_great
Users that are interested in be_great are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆244Jan 4, 2026Updated 3 months ago
- ☆68May 23, 2023Updated 2 years ago
- Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""☆195Jul 15, 2024Updated last year
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆542Jul 13, 2024Updated last year
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆651Feb 11, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Mar 9, 2023Updated 3 years ago
- Official GitHub for CTAB-GAN+☆85May 14, 2024Updated last year
- Benchmarking synthetic data generation methods.☆305Updated this week
- Experiments on Tabular Data Models☆279May 25, 2023Updated 2 years ago
- PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)☆15Oct 10, 2022Updated 3 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆258Apr 6, 2026Updated last week
- Official code for "CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis", ICML 2023☆37Jan 9, 2024Updated 2 years ago
- A modular Python framework for standardized evaluation and benchmarking of online learning models.☆10Nov 24, 2022Updated 3 years ago
- A framework for prototyping and benchmarking imputation methods☆199Apr 4, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Evaluate real and synthetic datasets against each other☆92Updated this week
- Software for evaluating the quality of synthetic data compared with real data.☆37Apr 8, 2026Updated last week
- Conditional GAN for generating synthetic tabular data.☆1,544Apr 7, 2026Updated last week
- tableGAN is a synthetic data generation technique (Data Synthesis based on Generative Adversarial Networks paper) based on Generative Ad…☆154May 1, 2019Updated 6 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆136Nov 23, 2025Updated 4 months ago
- Gaussian Membership Inference Privacy (NeurIPS 2023)☆12Jul 27, 2024Updated last year
- ☆341Nov 2, 2023Updated 2 years ago
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- Instance-based uncertainty estimation for gradient-boosted regression trees☆32Jul 25, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source Code of the ROAD benchmark for feature attribution methods (ICML22)☆24Jun 26, 2023Updated 2 years ago
- Synthetic data generation for tabular data☆3,467Updated this week
- Code for Transformed Distribution Matching (TDM) for Missing Value Imputation, ICML 2023☆14Aug 4, 2023Updated 2 years ago
- [ICLR 2024 spotlight] Making Pre-trained Language Models Great on Tabular Prediction☆68Jul 12, 2024Updated last year
- Implementation of the paper: "FedTabDiff: Federated Learning of Diffusion Models for Synthetic Mixed-Type Tabular Data Generation"☆23Nov 10, 2024Updated last year
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆6,041Updated this week
- A library of Reversible Data Transforms☆133Updated this week
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables☆215Mar 13, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- git for paper "CTAB-GAN: Effective Table Data Synthesizing"☆13Apr 25, 2022Updated 3 years ago
- ☆24Sep 16, 2022Updated 3 years ago
- This is the official codebase of `Exploring Generative Neural Temporal Point Process' (Accepted by TMLR).☆21May 22, 2023Updated 2 years ago
- Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024 Spotlight]☆111Nov 8, 2025Updated 5 months ago
- ☆15May 19, 2025Updated 10 months ago
- Implementation of SANTOS: Relationship-based Semantic Table Union Search.☆13Nov 21, 2023Updated 2 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆675Jun 24, 2025Updated 9 months ago