joeljang / Pretraining_T5_custom_datasetLinks

Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints

☆38

Alternatives and similar repositories for Pretraining_T5_custom_dataset

Users that are interested in Pretraining_T5_custom_dataset are comparing it to the libraries listed below

Sorting:

RUCAIBox / Transfer-Prompts-for-Text-Generation
☆27Updated 2 years ago
DevSinghSachan / art
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
☆62Updated 2 years ago
facebookresearch / bart_ls
Long-context pretrained encoder-decoder models
☆96Updated 2 years ago
Yushi-Hu / IC-DST
Code base of In-Context Learning for Dialogue State tracking
☆45Updated last year
prakharguptaz / Instructdial
Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
☆100Updated 2 years ago
Silin159 / PeaCoK
☆33Updated 4 months ago
jordiclive / ControlPrefixes
☆90Updated last year
salesforce / DialFact
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…
☆42Updated 2 years ago
Yale-LILY / FeTaQA
Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"
☆82Updated 2 years ago
microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆96Updated 2 years ago
DevSinghSachan / unsupervised-passage-reranking
Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"
☆101Updated 2 years ago
eladsegal / strategyqa
The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".
☆76Updated 2 years ago
wzhouad / NLL-IE
Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021
☆55Updated 3 years ago
amy-hyunji / Generative-Multihop-Retrieval
☆31Updated 2 years ago
Shark-NLP / CoNT
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆154Updated 2 years ago
xu1998hz / InstructScore_SEScore3
First explanation metric (diagnostic report) for text generation evaluation
☆62Updated 5 months ago
thu-coai / CPT4DST
Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).
☆27Updated 2 years ago
Yale-LILY / DYLE
Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization
☆55Updated 2 years ago
littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆26Updated last year
awslabs / durepa-hybrid-qa
☆13Updated last year
Alsace08 / SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
☆54Updated last year
alisawuffles / DExperts
code associated with ACL 2021 DExperts paper
☆115Updated 2 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆139Updated 3 years ago
soheeyang / unified-prompt-selection
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Updated 8 months ago
yilunzhao / Awsome-Table-Reasoning
A comprehensive paper list of Reasoning over Tables.
☆28Updated 2 years ago
luka-group / Lattice
[NAACL 2022] Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning.
☆57Updated last year
awslabs / pptod
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
☆159Updated last year
Yale-LILY / ROSE
☆39Updated 2 years ago
IBM / multidoc2dial
MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents
☆69Updated 3 years ago
amazon-science / summary-reference-revision
☆19Updated last year