基于Transformer的单模型、多尺度的VAE模型
☆57Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for univae
Users that are interested in univae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 无监督文本生成的一些方法☆49Jun 3, 2021Updated 4 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆33Jun 12, 2023Updated 2 years ago
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆40May 26, 2022Updated 3 years ago
- R-Drop方法在中文任务上的简单实验☆91Mar 2, 2022Updated 4 years ago
- This is CNN forward implementation.☆15May 24, 2021Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DataSets links for recommender systems research, in particular for transfer learning, user representation, pre-training,lifelong learning…☆17Feb 26, 2024Updated 2 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- WIDER FACE annotations converted to the Pascal VOC XML format☆16Jun 14, 2019Updated 6 years ago
- bert4keras实现gpt下中国象棋☆47Nov 12, 2020Updated 5 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 3 years ago
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"☆33Nov 25, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆58Sep 23, 2022Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Jul 15, 2022Updated 3 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 7 months ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Feb 24, 2023Updated 3 years ago
- Official code of our work, Syntax-augmented Multilingual BERT for Cross-lingual Transfer [ACL 2021].☆16Dec 2, 2021Updated 4 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 5 years ago
- Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text…☆35Aug 20, 2022Updated 3 years ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- M2Det++, faster and more accurate.☆17May 27, 2019Updated 6 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆51Dec 23, 2022Updated 3 years ago
- ☆50Jul 15, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 4 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago