Code for "Understanding and Improving Layer Normalization"
☆46Dec 8, 2019Updated 6 years ago
Alternatives and similar repositories for AdaNorm
Users that are interested in AdaNorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Jul 24, 2023Updated 2 years ago
- The dataset and code for the paper "Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information"☆19Oct 28, 2019Updated 6 years ago
- The codes and data for paper "Learning to Control the Fine-grained Sentiment for Story Ending Generation (ACL 2019)".☆26Sep 26, 2019Updated 6 years ago
- ☆16May 5, 2022Updated 3 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Mar 17, 2020Updated 6 years ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)☆42Aug 19, 2019Updated 6 years ago
- Visual Storytelling post-edit dataset☆18Sep 27, 2019Updated 6 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Oct 2, 2020Updated 5 years ago
- Example code for solving the Titanic prediction problem found on Kaggle.☆25Apr 30, 2013Updated 12 years ago
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Jun 20, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Learn models that are robust to spurious correlations in the dataset.☆26Dec 31, 2019Updated 6 years ago
- ☆24Jan 30, 2020Updated 6 years ago
- Code for the paper at AAAI2019☆59Aug 19, 2019Updated 6 years ago
- Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)☆147Sep 15, 2018Updated 7 years ago
- Transformer, Evolved Transformer Model☆10Jul 6, 2019Updated 6 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 4 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- some tutorials for blog: simonjisu.github.io☆23Mar 25, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for the paper☆11May 24, 2024Updated last year
- Learning DTW-Preserving Shapelets☆25Oct 5, 2023Updated 2 years ago
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- ☆26Dec 19, 2018Updated 7 years ago
- Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmen…☆60Jul 6, 2020Updated 5 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- Implementation of Dual Learning NMT & Joint Training on tensorflow☆12Dec 29, 2018Updated 7 years ago
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- Zero -- A neural machine translation system☆152May 8, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- Tool for Evaluating Adversarial Perturbations on Text☆61Feb 27, 2022Updated 4 years ago
- Android video semantic segmentation using DeeplabV3+ lite☆10Sep 20, 2019Updated 6 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- Aligned Diffusion Schroedinger Bridges (UAI 2023)☆33Oct 1, 2023Updated 2 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago