基于Transformer的单模型、多尺度的VAE模型
☆57Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for univae
Users that are interested in univae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 无监督文本生成的一些方法☆49Jun 3, 2021Updated 4 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆33Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆40May 26, 2022Updated 4 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 3 years ago
- R-Drop方法在中文任务上的简单实验☆91Mar 2, 2022Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- DataSets links for recommender systems research, in particular for transfer learning, user representation, pre-training,lifelong learning…☆17Feb 26, 2024Updated 2 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- bert4keras实现gpt下中国象棋☆47Nov 12, 2020Updated 5 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆122Mar 5, 2023Updated 3 years ago
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"☆33Nov 25, 2020Updated 5 years ago
- ☆58Sep 23, 2022Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Jul 15, 2022Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Feb 24, 2023Updated 3 years ago
- Official code of our work, Syntax-augmented Multilingual BERT for Cross-lingual Transfer [ACL 2021].☆16Dec 2, 2021Updated 4 years ago
- Smith-Heilmann Question Extraction (fork)☆17Jan 17, 2014Updated 12 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 5 years ago
- WWW'2019: Modeling Item-Specific Temporal Dynamics of Repeat Consumption for Recommender Systems☆35Oct 18, 2019Updated 6 years ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆51Dec 23, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆50Jul 15, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 4 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Jan 25, 2023Updated 3 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆181Nov 21, 2021Updated 4 years ago
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆17Jan 7, 2021Updated 5 years ago