Code for "Understanding and Improving Layer Normalization"
☆46Dec 8, 2019Updated 6 years ago
Alternatives and similar repositories for AdaNorm
Users that are interested in AdaNorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Jul 24, 2023Updated 2 years ago
- The dataset and code for the paper "Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information"☆19Oct 28, 2019Updated 6 years ago
- The codes and data for paper "Learning to Control the Fine-grained Sentiment for Story Ending Generation (ACL 2019)".☆26Sep 26, 2019Updated 6 years ago
- Code for the ACL'18 paper: A Neural Approach to Pun Generation☆18Jan 13, 2020Updated 6 years ago
- The QA datasets used for DrQA evaluation.☆14Nov 30, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 7 years ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Mar 17, 2020Updated 6 years ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)☆42Aug 19, 2019Updated 6 years ago
- Code for GSN: A Graph-Structured Network for Multi-Party Dialogues☆30Aug 10, 2019Updated 6 years ago
- Codes for paper "LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification"☆16Oct 30, 2019Updated 6 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Oct 2, 2020Updated 5 years ago
- Example code for solving the Titanic prediction problem found on Kaggle.☆25Apr 30, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Jun 20, 2021Updated 4 years ago
- Learn models that are robust to spurious correlations in the dataset.☆26Dec 31, 2019Updated 6 years ago
- ☆24Jan 30, 2020Updated 6 years ago
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- Code for the paper at AAAI2019☆59Aug 19, 2019Updated 6 years ago
- Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)☆147Sep 15, 2018Updated 7 years ago
- Code for Overcoming Catastrophic Forgetting for Continual Learning via Model Adaptation☆21May 7, 2019Updated 6 years ago
- Code for "A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation" (EMNLP 2018)☆63Aug 21, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 4 years ago
- some tutorials for blog: simonjisu.github.io☆23Mar 25, 2021Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- Embedding-based Scalable Segmentation Network☆28Oct 15, 2022Updated 3 years ago
- Code for the paper☆11May 24, 2024Updated last year
- ☆10Nov 27, 2023Updated 2 years ago
- An example docker container for runtime evaluation for the WIDER 2019 challenge track: face detection accuracy and runtime.☆17Aug 7, 2019Updated 6 years ago
- Neural Machine Translation with universal Visual Representation (ICLR 2020)☆91Jul 1, 2020Updated 5 years ago
- Implementation of Dual Learning NMT & Joint Training on tensorflow☆12Dec 29, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Lateral Inhibition-Inspired Convolutional Neural Network for Visual Attention and Saliency Detection☆13Nov 6, 2020Updated 5 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- Tool for Evaluating Adversarial Perturbations on Text☆61Feb 27, 2022Updated 4 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Apr 7, 2021Updated 5 years ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆13Jan 17, 2023Updated 3 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- Mirror of nofib repository. DO NOT SUBMIT PULL REQUESTS HERE☆14Sep 19, 2019Updated 6 years ago