cosmoquester / transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
☆33Updated last year
Alternatives and similar repositories for transformers-bart-pretrain:
Users that are interested in transformers-bart-pretrain are comparing it to the libraries listed below
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆62Updated last year
- Pre-training BART in Flax on The Pile dataset☆20Updated 3 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Test code of Inverse cloze task for information retrieval☆33Updated 4 years ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization☆15Updated last year
- Abstractive summarization using Bert2Bert framework.☆31Updated 4 years ago
- A Generative Dialogue State Tracking Model☆22Updated 3 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- ☆11Updated 4 years ago
- ☆25Updated 2 years ago
- DSTC 11 Track 2: Intent Induction from Conversations for Task-Oriented Dialogue☆47Updated last year
- Long-context pretrained encoder-decoder models☆94Updated 2 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Updated 2 years ago
- Code for Handling Divergent Reference Texts when Evaluating Table-to-Text Generation (Dhingra et al. 2019)☆31Updated 3 years ago
- Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering (Kim et al., ACL 2021)☆32Updated 2 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- Code for DS2 paper☆20Updated 2 years ago
- ☆53Updated last year
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling☆9Updated 2 years ago
- ☆26Updated 2 years ago
- Megatron LM 11B on Huggingface Transformers☆27Updated 3 years ago
- Source code for Dialogue State Tracking with a Language Model using Schema-Driven Prompting☆63Updated 4 months ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- The code repository for NAACL 2021 paper "AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization".☆34Updated 3 years ago
- ☆13Updated 3 years ago
- ☆44Updated 3 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago