zheng-yanan / techniques-for-kl-vanishing
This repository summarizes techniques for KL divergence vanishing problem.
☆30Updated 5 years ago
Alternatives and similar repositories for techniques-for-kl-vanishing
Users that are interested in techniques-for-kl-vanishing are comparing it to the libraries listed below
Sorting:
- pytorch implementation of VAE-Gumble-Softmax☆63Updated 4 years ago
- Neural State Machine implemented in PyTorch☆71Updated 5 years ago
- Variational Transformers for Diverse Response Generation☆81Updated 9 months ago
- Project page for paper Self-supervised Representation Learning with Relative Predictive Coding☆17Updated 3 years ago
- implements optimal transport algorithms in pytorch☆97Updated 3 years ago
- Transformer-Based Conditioned Variational Autoencoder for Story Completion☆94Updated 4 years ago
- Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation☆69Updated 4 years ago
- Weighted Training for Cross-Task Learning☆15Updated 2 years ago
- ☆12Updated 3 years ago
- Hyperbolic Neural Networks, pytorch☆86Updated 5 years ago
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆65Updated 2 years ago
- ☆36Updated 4 years ago
- ☆24Updated 3 years ago
- ☆27Updated 5 years ago
- This repository hosts the dataset and source code for "A causal view of compositional zero-shot recognition". Yuval Atzmon, Felix Kreuk, …☆27Updated 3 years ago
- ☆45Updated 4 years ago
- ☆36Updated 4 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆28Updated 6 years ago
- Tensorflow implementation of Invariant Rationalization☆49Updated 2 years ago
- Variational Autoencoder with Spatial Broadcast Decoder☆35Updated 5 years ago
- [ICLR 2020] FSPool: Learning Set Representations with Featurewise Sort Pooling☆42Updated last year
- A Pytorch implementation of the optimal transport kernel embedding☆116Updated 4 years ago
- Mixed-curvature Variational Autoencoders (ICLR 2020)☆62Updated 4 years ago
- Transformer Variational Autoencoder experiment☆49Updated 6 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆45Updated 2 years ago
- Code to reproduce the results for Compositional Attention☆60Updated 2 years ago
- MoCo with Alignment and Uniformity Loss.☆62Updated 3 years ago
- [ICML2023] InfoOT: Information Maximizing Optimal Transport☆40Updated 2 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- Code for the paper Physics-as-Inverse-Graphics: Joint Unsupervised Learning of Objects and Physics from Video☆41Updated last year