cosmoquester / transformers-bart-pretrainLinks

Script to pre-train hugginface transformers BART with Tensorflow 2

☆33

Alternatives and similar repositories for transformers-bart-pretrain

Users that are interested in transformers-bart-pretrain are comparing it to the libraries listed below

Sorting:

monologg / EncT5
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆63Updated 3 years ago
pkchat-focus / FoCus
Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge
☆62Updated 2 years ago
hyunwoongko / bert2bert-summarization
Abstractive summarization using Bert2Bert framework.
☆31Updated 4 years ago
jeewoo1025 / BalSum
ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization
☆16Updated last year
morganmcg1 / rotobart
Pre-training BART in Flax on The Pile dataset
☆21Updated 4 years ago
facebookresearch / bart_ls
Long-context pretrained encoder-decoder models
☆96Updated 2 years ago
hwanheelee1993 / MFMA
Factual consistency checking model for abstractive summaries (NAACL-22 Findings)
☆30Updated 3 years ago
donggyukimc / Inverse-cloze-task
Test code of Inverse cloze task for information retrieval
☆33Updated 4 years ago
Huffon / factsumm
FactSumm: Factual Consistency Scorer for Abstractive Summarization
☆111Updated last year
robinsongh381 / UNILM_Pytorch_Korean
☆11Updated 5 years ago
Jimin9401 / avocado
AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain
☆23Updated 3 years ago
laihuiyuan / pre-trained-formality-transfer
Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)
☆30Updated 2 years ago
amazon-science / dstc11-track2-intent-induction
DSTC 11 Track 2: Intent Induction from Conversations for Task-Oriented Dialogue
☆48Updated 2 years ago
hyunwoongko / megatron-11b
Megatron LM 11B on Huggingface Transformers
☆27Updated 4 years ago
naver-ai / hypermix
Code for text augmentation method leveraging large-scale language models
☆62Updated 3 years ago
zengyan-97 / Transformer-DST
A Generative Dialogue State Tracking Model
☆22Updated 4 years ago
andrejmiscic / simcls-pytorch
PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"
☆16Updated 3 years ago
jshin49 / ds2
Code for DS2 paper
☆20Updated 3 years ago
Beomi / transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23Updated 4 years ago
naver-ai / carecall-corpus
CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).
☆60Updated 3 years ago
bepoetree / MTTOD
☆26Updated 3 years ago
AIRC-KETI / kowow
This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…
☆16Updated 2 years ago
salesforce / DialFact
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…
☆42Updated 2 years ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
monologg / py-backtrans
Python library for backtranslation (with Google Translate)
☆12Updated 5 years ago
nlpods / LayerAttPooler
Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling
☆9Updated 2 years ago
DevSinghSachan / art
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
☆62Updated 2 years ago
e0397123 / dstc10_metric_track
The Official Repository for the Automatic Dialogue Evaluation Sub-task of DSTC10 Track 5 (Automatic Evaluation and Moderation of Open-dom…
☆19Updated 3 years ago
naver-ai / KoBBQ
Official code and dataset repository of KoBBQ (TACL 2024)
☆18Updated last year
clovaai / minimal-rnr-qa
[NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering
☆36Updated 4 years ago