zheng-yanan / techniques-for-kl-vanishingLinks
This repository summarizes techniques for KL divergence vanishing problem.
☆30Updated 6 years ago
Alternatives and similar repositories for techniques-for-kl-vanishing
Users that are interested in techniques-for-kl-vanishing are comparing it to the libraries listed below
Sorting:
- ☆198Updated 3 years ago
- This repo is for our paper "ControlVAE: Controllable Variational Autoencoder" published at ICML 2020. It can be used for text generation,…☆96Updated 2 years ago
- Transformer Variational Autoencoder experiment☆50Updated 6 years ago
- Hyperbolic Neural Networks, pytorch☆87Updated 6 years ago
- A Pytorch implementation of the optimal transport kernel embedding☆119Updated 4 years ago
- A curated list of techniques to avoid posterior collapse☆86Updated 2 years ago
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆67Updated 3 years ago
- Transformer-Based Conditioned Variational Autoencoder for Story Completion☆95Updated 5 years ago
- Variational Transformers for Diverse Response Generation☆82Updated last year
- pytorch implementation of VAE-Gumble-Softmax☆63Updated 5 years ago
- Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation☆69Updated 5 years ago
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆162Updated 3 years ago
- Neural State Machine implemented in PyTorch☆71Updated 6 years ago
- A σ-VAE implementation in PyTorch☆101Updated 4 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75Updated 2 years ago
- PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution☆213Updated 7 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆56Updated 4 years ago
- ☆24Updated 4 years ago
- PyTorch implementation of "Lagging Inference Networks and Posterior Collapse in Variational Autoencoders" (ICLR 2019)☆185Updated 5 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Updated 4 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- ☆36Updated 5 years ago
- ☆43Updated 3 years ago
- Implementation of Sparsemax activation in Pytorch☆166Updated 5 years ago
- ☆33Updated 6 years ago
- PyTorch implementation of NICE☆34Updated 6 years ago
- Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…☆17Updated 5 years ago
- Code to reproduce the results for Compositional Attention☆59Updated 3 years ago
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆170Updated 4 years ago
- implements optimal transport algorithms in pytorch☆104Updated 3 years ago