joeljang / Pretraining_T5_custom_dataset
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆39Updated 3 years ago
Alternatives and similar repositories for Pretraining_T5_custom_dataset:
Users that are interested in Pretraining_T5_custom_dataset are comparing it to the libraries listed below
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 6 months ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆31Updated 3 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- ☆26Updated 2 years ago
- ☆31Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆42Updated last month
- ☆28Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 7 months ago
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆61Updated last year
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated last year
- ☆15Updated 2 years ago
- DSTC 11 Track 2: Intent Induction from Conversations for Task-Oriented Dialogue☆45Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆10Updated 2 months ago
- ☆10Updated 4 months ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆61Updated last year
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆24Updated 2 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆65Updated 2 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆69Updated 8 months ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆79Updated last year
- Code and Data Repo for [ACL 2023] Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆23Updated 10 months ago