基于Transformer的单模型、多尺度的VAE模型
☆57Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for univae
Users that are interested in univae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 无监督文本生成的一些方法☆49Jun 3, 2021Updated 4 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆40May 26, 2022Updated 3 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 2 years ago
- R-Drop方法在中文任务上的简单实验☆91Mar 2, 2022Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- DataSets links for recommender systems research, in particular for transfer learning, user representation, pre-training,lifelong learning…☆17Feb 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- bert4keras实现gpt下中国象棋☆47Nov 12, 2020Updated 5 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 3 years ago
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"☆33Nov 25, 2020Updated 5 years ago
- ☆59Sep 23, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Jul 15, 2022Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 7 months ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- Official code of our work, Syntax-augmented Multilingual BERT for Cross-lingual Transfer [ACL 2021].☆16Dec 2, 2021Updated 4 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 4 years ago
- Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text…☆35Aug 20, 2022Updated 3 years ago
- WWW'2019: Modeling Item-Specific Temporal Dynamics of Repeat Consumption for Recommender Systems☆35Oct 18, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- ☆50Jul 15, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Nov 21, 2021Updated 4 years ago
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆17Jan 7, 2021Updated 5 years ago