基于Transformer的单模型、多尺度的VAE模型
☆58Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for univae
Users that are interested in univae are comparing it to the libraries listed below
Sorting:
- 无监督文本生成的一些方法☆49Jun 3, 2021Updated 4 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- ☆10Mar 28, 2022Updated 3 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- bert4keras实现gpt下中国象棋☆46Nov 12, 2020Updated 5 years ago
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Jul 15, 2022Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 2 years ago
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"☆33Nov 25, 2020Updated 5 years ago
- huggingface ChineseBert Tokenizer☆16Apr 16, 2022Updated 3 years ago
- DataSets links for recommender systems research, in particular for transfer learning, user representation, pre-training,lifelong learning…☆17Feb 26, 2024Updated 2 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- R-Drop方法在中文任务上的简单实验☆91Mar 2, 2022Updated 4 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- ☆23Jun 30, 2025Updated 8 months ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- torch TH/THC c++11 wrapper☆14Jun 14, 2017Updated 8 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- A python open-source distributed in-memory cache and database.☆21Jul 30, 2020Updated 5 years ago
- GAU-alpha-pytorch☆20May 11, 2022Updated 3 years ago
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆168Oct 7, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆116Oct 27, 2022Updated 3 years ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- NLP实验:新词挖掘+预训练模型继续Pre-training☆47Sep 15, 2023Updated 2 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago