zheng-yanan / techniques-for-kl-vanishing
This repository summarizes techniques for KL divergence vanishing problem.
☆28Updated 4 years ago
Related projects: ⓘ
- Transformer Variational Autoencoder experiment☆49Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆61Updated 4 years ago
- Variational Transformers for Diverse Response Generation☆82Updated last month
- A Pytorch implementation of the optimal transport kernel embedding☆111Updated 3 years ago
- Neural State Machine implemented in PyTorch☆70Updated 4 years ago
- Code for reversible recurrent neural networks☆38Updated 5 years ago
- ☆11Updated 3 years ago
- Transformer-Based Conditioned Variational Autoencoder for Story Completion☆94Updated 4 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆24Updated 6 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆89Updated 6 months ago
- Code for the paper PermuteFormer☆43Updated 2 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75Updated last year
- This is a PyTorch implementation of the ICLR 2017 paper "HIERARCHICAL MULTISCALE RECURRENT NEURAL NETWORKS" (https://openreview.net/pdf?i…☆50Updated 6 years ago
- [ICLR 2020] FSPool: Learning Set Representations with Featurewise Sort Pooling☆42Updated 11 months ago
- Dataset and documentation for paper on explaining solutions to physical reasoning tasks (ESPRIT))☆21Updated 2 years ago
- [ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"☆91Updated last year
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆63Updated 2 years ago
- Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation☆69Updated 3 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆62Updated 4 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆26Updated 3 years ago
- code for paper "Improving Sequence-to-Sequence Learning via Optimal Transport"☆69Updated 5 years ago
- A PyTorch implementation of Compositional Attention Networks☆23Updated 6 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆54Updated 2 years ago
- ☆180Updated last year
- Systematic generalization test for CLEVR☆15Updated 4 years ago
- PyTorch implementation for The Scattering Compositional Learner (SCL)☆32Updated 4 years ago
- ☆63Updated 2 years ago
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Updated last year
- Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.☆49Updated 11 months ago