Code for "Mixed Cross Entropy Loss for Neural Machine Translation"
☆20Jul 23, 2021Updated 4 years ago
Alternatives and similar repositories for mix
Users that are interested in mix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- statnlp-neural☆32Sep 26, 2019Updated 6 years ago
- codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)☆20Aug 31, 2021Updated 4 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆16Jan 8, 2022Updated 4 years ago
- Differentiable Perturb-and-Parse operator☆25Mar 7, 2019Updated 7 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- Official implementation of "Learning Proposals for Practical Energy-Based Regression", AISTATS 2022.☆13Feb 4, 2023Updated 3 years ago
- ☆16Mar 25, 2022Updated 4 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 5 years ago
- ☆63Mar 20, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast and Modularized CFG-focused Models☆23Nov 8, 2023Updated 2 years ago
- ☆14Apr 15, 2023Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- ☆13Aug 27, 2021Updated 4 years ago
- source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…☆52Mar 28, 2025Updated last year
- The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…☆20Jan 18, 2025Updated last year
- Blog post☆17Feb 16, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆16Feb 14, 2023Updated 3 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆24Jan 14, 2020Updated 6 years ago
- PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656☆25Jun 12, 2023Updated 2 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- Implementation of Dual Learning NMT & Joint Training on tensorflow☆12Dec 29, 2018Updated 7 years ago
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 6 years ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- Source code for "Importance-based Neuron Allocation for Multilingual Neural Machine Translation"☆12Sep 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Jul 23, 2021Updated 4 years ago
- Learning Latent Forests for Medical Relation Extraction (authors' PyTorch implementation for the IJCAI20 paper)☆24Oct 31, 2020Updated 5 years ago
- ☆57Jun 3, 2022Updated 3 years ago
- Dynamic data selection for neural machine translation☆20Jan 28, 2018Updated 8 years ago
- Implementation for Jacobian Adversarially Regularized Networks for Robustness (ICLR 2020)☆22Dec 30, 2019Updated 6 years ago
- Ying Nian Wu's UCLA Statistical Machine Learning Tutorial on generative modeling.☆63Jan 7, 2023Updated 3 years ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago