Olliang / Statistical-Similarity-Measurement
A methodology designed to validate the statistical similarity of synthetic data generated by GAN models. The metrics contain Auto-encoder, PCA, t-SNE, KL-divergence, Clustering, and Cosine Similarity.
☆11Updated 4 years ago
Alternatives and similar repositories for Statistical-Similarity-Measurement:
Users that are interested in Statistical-Similarity-Measurement are comparing it to the libraries listed below
- [Python] Additional works on Edward Choi's medGAN (generative adversarial network for electronic health records). In particular: boosting…☆24Updated 2 years ago
- COR-GAN: Correlation-Capturing Convolutional Neural Networks for Generating Synthetic Healthcare Records☆56Updated 4 years ago
- Pacmed Labs experiments on uncertainty estimation, focusing on unbalanced tabular data and classification tasks.☆21Updated 3 years ago
- Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANs☆44Updated last year
- ☆23Updated this week
- Counterfactual SHAP: a framework for counterfactual feature importance☆18Updated last year
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metrics☆45Updated 2 years ago
- Source code of ME2Vec.☆14Updated last year
- Multi-class probabilistic classification using inductive and cross Venn–Abers predictors☆44Updated 2 years ago
- An interpretable kNN based on aggregating the predictions of multiple 2d spaces.☆13Updated 6 months ago
- Feature Selection using Simulated Annealing☆11Updated 2 years ago
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆17Updated last year
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets☆65Updated 2 years ago
- Helpers for scikit learn☆16Updated 2 years ago
- Perform inference on algorithm-agnostic variable importance in Python☆20Updated 2 years ago
- Repository for code release of paper "Robust Variational Autoencoders for Outlier Detection and Repair of Mixed-Type Data" (AISTATS 2020)☆50Updated 5 years ago
- Evaluate real and synthetic datasets against each other☆86Updated 3 months ago
- Phe2vec: Automated Disease Phenotyping based on Unsupervised Embeddings from Electronic Health Records☆24Updated 4 years ago
- Deep Kernel Survival Analysis and Subject-Specific Survival Time Prediction Intervals☆13Updated 3 years ago
- ☆18Updated last year
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 9 months ago
- Ensemble-based, size-agnostic wrapper for the TabPFN classifier☆31Updated 11 months ago
- [Python] Comparison of empirical probability distributions. Integral probability metrics (e.g. Kantorovich metric). f-divergences (e.g. K…☆11Updated 2 years ago
- stand alone Neural Additive Models, forked from google-reasearch for easy import to colab☆28Updated 4 years ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆12Updated 5 years ago
- GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Data…☆70Updated 2 years ago
- Resources for Machine Learning Explainability☆76Updated 7 months ago
- [Experimental] Causal graphs that are networkx-compliant for the py-why ecosystem.☆55Updated this week
- ☆10Updated 4 years ago
- A benchmark to evaluate popular CASH and AutoML frameworks☆16Updated last year