MAsked Sequence to Sequence (MASS) pre-training for language generation
☆20Mar 18, 2019Updated 7 years ago
Alternatives and similar repositories for MASS
Users that are interested in MASS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 8 years ago
- Pytorch implementation of QAnet☆13May 6, 2018Updated 7 years ago
- Gated CNN☆10Jul 17, 2019Updated 6 years ago
- Analyzing Uncertainty in Neural Machine Translation☆35Sep 15, 2021Updated 4 years ago
- Code for AAAI-2020 oral paper: RefNet: A Reference-aware Network for Background Based Conversation☆25Nov 24, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆19May 6, 2021Updated 4 years ago
- Deep learning study in Gluon 2nd edition☆24Mar 6, 2019Updated 7 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 3 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- seq2seq with attention in mxnet☆18Oct 13, 2017Updated 8 years ago
- A Pointer Generator with a BERT encoder☆10Aug 12, 2019Updated 6 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,121Nov 28, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Qualitative evaluation of automatic chord extraction results: analysis of the musical relationships between predicted chords and target c…☆10Oct 25, 2021Updated 4 years ago
- A gym environment for the research which apply the reinforcement learning algorithm to the RNA structure prediction☆12Aug 1, 2019Updated 6 years ago
- QWOP AI using Q-learning☆12Jul 13, 2016Updated 9 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 11 months ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Nov 8, 2023Updated 2 years ago
- MXNet implementation of WaveNet☆19Oct 20, 2016Updated 9 years ago
- Sequential Matching Network implemented by MXNET☆18Mar 11, 2019Updated 7 years ago
- Crowdsourced data for open domain relation classification from sentences☆20Oct 26, 2018Updated 7 years ago
- 2019语言与智能技术竞赛-基于知识图谱的主动聊天☆115May 24, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- ISMIR 2021: Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition☆10Nov 8, 2021Updated 4 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- Working towards a python implementation of image completion.☆12May 14, 2014Updated 11 years ago
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆12Apr 30, 2019Updated 6 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- DSTC6: End-to-End Conversation Modeling Track☆57Jan 19, 2018Updated 8 years ago
- Test whether the "focus" mechanism is a valuable addition to attention☆12Dec 8, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- a simple tool to translate caffe model to keras model☆10Oct 26, 2015Updated 10 years ago
- Mxnet implementation of an ICLR 2018 paper: A new method of region embedding for text classification.☆10Oct 14, 2018Updated 7 years ago
- ☆20Feb 26, 2021Updated 5 years ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- 山东大学青岛校区 地图APP☆10Nov 7, 2020Updated 5 years ago
- Question-Answering ranking with Deep Learning models (cDSSM, Convolutional, LSTM, Word2Vec). Applied to InsuranceQA dataset and customer …☆16Oct 17, 2017Updated 8 years ago
- MxNet Gluon Implementation of Center Loss: A Discriminative Feature Learning Approach for Deep Face Recognition☆34Nov 8, 2017Updated 8 years ago