MAsked Sequence to Sequence (MASS) pre-training for language generation
☆20Mar 18, 2019Updated 7 years ago
Alternatives and similar repositories for MASS
Users that are interested in MASS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 8 years ago
- Conference notes for AAAI 2019☆15Feb 1, 2019Updated 7 years ago
- Gated CNN☆10Jul 17, 2019Updated 6 years ago
- Analyzing Uncertainty in Neural Machine Translation☆35Sep 15, 2021Updated 4 years ago
- Soft Contextual Data Augmentation☆39Jul 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codes for <Kernelized Bayesian Softmax for Text Generation> in NeurIPS 2019☆16Nov 20, 2019Updated 6 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆19May 6, 2021Updated 5 years ago
- A third-party implementation of paper《SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spell…☆14Nov 27, 2020Updated 5 years ago
- ☆121Dec 8, 2022Updated 3 years ago
- Deep learning study in Gluon 2nd edition☆24Mar 6, 2019Updated 7 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- A Pointer Generator with a BERT encoder☆10Aug 12, 2019Updated 6 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,121Nov 28, 2022Updated 3 years ago
- A gym environment for the research which apply the reinforcement learning algorithm to the RNA structure prediction☆12Aug 1, 2019Updated 6 years ago
- LibCP -- A Library for Conformal Prediction☆13Feb 26, 2015Updated 11 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Nov 8, 2023Updated 2 years ago
- Sequential Matching Network implemented by MXNET☆18Mar 11, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- A difficulty-aware embedding of complementary deep networks for image classification☆13Jul 25, 2024Updated last year
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆12Apr 30, 2019Updated 7 years ago
- ☆10Jan 25, 2019Updated 7 years ago
- Test whether the "focus" mechanism is a valuable addition to attention☆12Dec 8, 2022Updated 3 years ago
- Mxnet implementation of an ICLR 2018 paper: A new method of region embedding for text classification.☆10Oct 14, 2018Updated 7 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.☆16Nov 7, 2022Updated 3 years ago
- 山东大学青岛校区 地图APP☆10Nov 7, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Question-Answering ranking with Deep Learning models (cDSSM, Convolutional, LSTM, Word2Vec). Applied to InsuranceQA dataset and customer …☆16Oct 17, 2017Updated 8 years ago
- MxNet Gluon Implementation of Center Loss: A Discriminative Feature Learning Approach for Deep Face Recognition☆34Nov 8, 2017Updated 8 years ago
- 中文开放聊天语料整理☆13Nov 5, 2018Updated 7 years ago
- Implementation of DTMT with incremental decoding☆13Feb 20, 2021Updated 5 years ago
- ☆20Apr 16, 2025Updated last year
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆10Aug 20, 2021Updated 4 years ago
- Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning☆14Apr 11, 2022Updated 4 years ago