MAsked Sequence to Sequence (MASS) pre-training for language generation
☆20Mar 18, 2019Updated 7 years ago
Alternatives and similar repositories for MASS
Users that are interested in MASS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 9 years ago
- Experiments with AllenNLP on semantic parsing datasets☆17Dec 29, 2018Updated 7 years ago
- Analyzing Uncertainty in Neural Machine Translation☆35Sep 15, 2021Updated 4 years ago
- open book question answering☆15Dec 8, 2022Updated 3 years ago
- Code for AAAI-2020 oral paper: RefNet: A Reference-aware Network for Background Based Conversation☆25Nov 24, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 6 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆19May 6, 2021Updated 5 years ago
- Deep learning study in Gluon 2nd edition☆24Mar 6, 2019Updated 7 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆35Sep 15, 2019Updated 6 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 4 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- seq2seq with attention in mxnet☆18Oct 13, 2017Updated 8 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Dockerfile and instructions for human pose estimation implementation using Caffe, OpenCV 3.1.0 and Python 2.7.☆12Mar 3, 2019Updated 7 years ago
- A Pointer Generator with a BERT encoder☆10Aug 12, 2019Updated 6 years ago
- a trained attention-based summarization model☆10May 22, 2017Updated 9 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,120Nov 28, 2022Updated 3 years ago
- Qualitative evaluation of automatic chord extraction results: analysis of the musical relationships between predicted chords and target c…☆10Oct 25, 2021Updated 4 years ago
- A gym environment for the research which apply the reinforcement learning algorithm to the RNA structure prediction☆12Aug 1, 2019Updated 6 years ago
- LibCP -- A Library for Conformal Prediction☆13Feb 26, 2015Updated 11 years ago
- Thin wrapper for the AllenNLP's implementation of supervised open information extraction☆17Nov 19, 2019Updated 6 years ago
- QWOP AI using Q-learning☆12Jul 13, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated last year
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- Crowdsourced data for open domain relation classification from sentences☆20Oct 26, 2018Updated 7 years ago
- 2019语言与智能技术竞赛-基于知识图谱的主动聊天☆115May 24, 2019Updated 7 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆40Jun 7, 2020Updated 5 years ago
- This repository contains the metadata and data of different databases that we use for testing☆14Jan 29, 2025Updated last year
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆12Apr 30, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- DSTC6: End-to-End Conversation Modeling Track☆57Jan 19, 2018Updated 8 years ago
- Test whether the "focus" mechanism is a valuable addition to attention☆12Dec 8, 2022Updated 3 years ago
- 感谢大家的pull request☆17Oct 21, 2015Updated 10 years ago
- a simple tool to translate caffe model to keras model☆10Oct 26, 2015Updated 10 years ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.☆16Nov 7, 2022Updated 3 years ago