基于Transformer的单模型、多尺度的VAE模型
☆58Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for univae
Users that are interested in univae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 无监督文本生成的一些方法☆49Jun 3, 2021Updated 4 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆33Jun 12, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆40May 26, 2022Updated 3 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 2 years ago
- R-Drop方法在中文任务上的简单实验☆91Mar 2, 2022Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- DataSets links for recommender systems research, in particular for transfer learning, user representation, pre-training,lifelong learning…☆17Feb 26, 2024Updated 2 years ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆36Aug 25, 2022Updated 3 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- ☆22Jul 10, 2023Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 3 years ago
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"☆33Nov 25, 2020Updated 5 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Jul 15, 2022Updated 3 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 6 months ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Smith-Heilmann Question Extraction (fork)☆17Jan 17, 2014Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text…☆35Aug 20, 2022Updated 3 years ago
- WWW'2019: Modeling Item-Specific Temporal Dynamics of Repeat Consumption for Recommender Systems☆35Oct 18, 2019Updated 6 years ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- Pytorch implementation of Performer from the paper "Rethinking Attention with Performers".☆25Oct 5, 2020Updated 5 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆51Dec 23, 2022Updated 3 years ago
- ☆50Jul 15, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 3 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Jan 25, 2023Updated 3 years ago
- Open-Source Neural Machine Translation in PyTorch http://opennmt.net/☆17Jan 7, 2021Updated 5 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Nov 21, 2021Updated 4 years ago
- Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.☆48Jul 22, 2023Updated 2 years ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- torch TH/THC c++11 wrapper☆14Jun 14, 2017Updated 8 years ago
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago