joeljang / Pretraining_T5_custom_dataset
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆38Updated 4 years ago
Alternatives and similar repositories for Pretraining_T5_custom_dataset
Users that are interested in Pretraining_T5_custom_dataset are comparing it to the libraries listed below
Sorting:
- [EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarization☆25Updated last year
- ☆26Updated 2 years ago
- ☆32Updated last month
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆80Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 2 months ago
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Updated 4 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆73Updated 2 years ago
- A comprehensive paper list of Reasoning over Tables.☆28Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago
- Long-context pretrained encoder-decoder models☆94Updated 2 years ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization☆15Updated last year
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- ☆28Updated 2 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated last year
- ☆38Updated last year
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆120Updated last year
- Script to pre-train hugginface transformers BART with Tensorflow 2☆33Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- Lexically constrained text generation with CBART.☆48Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- [COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization☆25Updated last year
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆17Updated 6 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆103Updated 2 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆10Updated 6 months ago
- Code for EMNLP 2021 paper "CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization"☆46Updated 3 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152Updated 2 years ago