Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"
☆77Jun 12, 2021Updated 5 years ago
Alternatives and similar repositories for seq2seq-con
Users that are interested in seq2seq-con are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving cross-lingual word embeddings by meeting in the middle☆23Aug 25, 2020Updated 5 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 4 years ago
- A tool for holistic analysis of language generations systems☆471Sep 22, 2025Updated 9 months ago
- ☆28Nov 28, 2021Updated 4 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Jun 17, 2016Updated 10 years ago
- ☆12Aug 31, 2021Updated 4 years ago
- Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"☆35Nov 8, 2024Updated last year
- ☆12Sep 1, 2021Updated 4 years ago
- Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"☆172Dec 12, 2018Updated 7 years ago
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 6 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- ☆20Dec 18, 2022Updated 3 years ago
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Jun 2, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Norm-Based Curriculum Learning for Neural Machine Translation (ACL 2020)☆18Aug 1, 2020Updated 5 years ago
- ☆20Jun 14, 2019Updated 7 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- ☆10Jun 11, 2019Updated 7 years ago
- FEVER Workshop Shared-Task☆16Apr 16, 2019Updated 7 years ago
- PyTorch implementation of "Lagging Inference Networks and Posterior Collapse in Variational Autoencoders" (ICLR 2019)☆185Dec 15, 2020Updated 5 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of Proximal Gradient Algorithms a la Parikh and Boyd (2014). Useful for Auto-Sizing (Murray and Chiang 2015, Murra…☆41Sep 20, 2020Updated 5 years ago
- A curated list of resources on energy-based models.☆11Mar 14, 2022Updated 4 years ago
- ☆54Oct 24, 2024Updated last year
- ☆11Mar 17, 2026Updated 3 months ago
- Non-Monotonic Sequential Text Generation (ICML 2019)☆72May 2, 2019Updated 7 years ago
- A tool for converting TMX files into bilingual corpora☆19Feb 4, 2020Updated 6 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆94Aug 29, 2018Updated 7 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- ☆36Oct 20, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Aug 29, 2023Updated 2 years ago
- Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"☆94May 2, 2018Updated 8 years ago
- JavaScript virtual machine for WebAssembly and Taint Tracking security tool☆15Dec 15, 2017Updated 8 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆165Apr 13, 2026Updated 2 months ago
- Variational inference for Bayesian logistic regression☆13Aug 30, 2019Updated 6 years ago
- ☆24Jan 20, 2021Updated 5 years ago
- ☆16Jul 6, 2023Updated 2 years ago