Code for "Understanding and Improving Layer Normalization"
☆46Dec 8, 2019Updated 6 years ago
Alternatives and similar repositories for AdaNorm
Users that are interested in AdaNorm are comparing it to the libraries listed below
Sorting:
- The dataset and code for the paper "Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information"☆19Oct 28, 2019Updated 6 years ago
- The codes and data for paper "Learning to Control the Fine-grained Sentiment for Story Ending Generation (ACL 2019)".☆26Sep 26, 2019Updated 6 years ago
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 6 years ago
- The QA datasets used for DrQA evaluation.☆14Nov 30, 2018Updated 7 years ago
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Jul 24, 2023Updated 2 years ago
- Efficient Segmentation: Learning Downsampling Near Semantic Boundaries☆32Sep 29, 2020Updated 5 years ago
- An example docker container for runtime evaluation for the WIDER 2019 challenge track: face detection accuracy and runtime.☆17Aug 7, 2019Updated 6 years ago
- ☆15May 23, 2022Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 4 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 2 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Mar 17, 2020Updated 5 years ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- Data and code for paper "Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations"☆17Jun 30, 2019Updated 6 years ago
- Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)☆42Aug 19, 2019Updated 6 years ago
- Dynamic data selection for neural machine translation☆20Jan 28, 2018Updated 8 years ago
- Codes for paper "LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification"☆16Oct 30, 2019Updated 6 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Jun 20, 2021Updated 4 years ago
- ☆24Nov 21, 2020Updated 5 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- ☆18Jul 30, 2018Updated 7 years ago
- Word sense disambiguation using contextualized word embedding☆17Dec 18, 2019Updated 6 years ago
- Example code for solving the Titanic prediction problem found on Kaggle.☆25Apr 30, 2013Updated 12 years ago
- ☆24Jan 30, 2020Updated 6 years ago
- Neural Machine Translation with universal Visual Representation (ICLR 2020)☆90Jul 1, 2020Updated 5 years ago
- implementation EDANet by pytorch☆21Sep 30, 2018Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)☆147Sep 15, 2018Updated 7 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- The Importance of Being Recurrent for Modeling Hierarchical Structure☆25Jun 27, 2018Updated 7 years ago
- This is the code for "Learning Sentiment Memories for Sentiment Modification without Parallel Data".☆55Dec 18, 2018Updated 7 years ago
- Code for SelfAugment☆27Dec 16, 2020Updated 5 years ago
- ☆26Dec 19, 2018Updated 7 years ago
- Embedding-based Scalable Segmentation Network☆28Oct 15, 2022Updated 3 years ago
- code for Explicit Sparse Transformer☆61Jul 21, 2023Updated 2 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆34May 16, 2023Updated 2 years ago