nawnoes / pytorch-gpt-xLinks

Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.

☆32

Alternatives and similar repositories for pytorch-gpt-x

Users that are interested in pytorch-gpt-x are comparing it to the libraries listed below

Sorting:

facebookresearch / ketod
KETOD Knowledge-Enriched Task-Oriented Dialogue
☆32Updated 2 years ago
naver-ai / neuralwoz
NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)
☆36Updated 3 years ago
jason9693 / ETA4LLMs
Calculating Expected Time for training LLM.
☆38Updated 2 years ago
nlpods / LayerAttPooler
Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling
☆9Updated 2 years ago
gentaiscool / few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
☆53Updated 3 years ago
amazon-science / transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆52Updated 2 years ago
monologg / EncT5
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆63Updated 3 years ago
tlkh / t2t-tuner
Convenient Text-to-Text Training for Transformers
☆19Updated 3 years ago
Beomi / transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23Updated 4 years ago
robinsongh381 / UNILM_Pytorch_Korean
☆11Updated 5 years ago
sooftware / luna-transformer
A PyTorch Implementation of the Luna: Linear Unified Nested Attention
☆41Updated 3 years ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
jeongukjae / tta
Transformer-based Text Auto-encoder (T-TA) using TensorFlow 2.
☆13Updated 4 years ago
hyunwoongko / bert2bert-summarization
Abstractive summarization using Bert2Bert framework.
☆31Updated 4 years ago
ModuNLP / hacking_transformers
☆11Updated 4 years ago
codertimo / python-template
python project template for personal projects! 🙋‍♀️
☆10Updated 4 years ago
MichaelZhouwang / Sequence_Span_Rewriting
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Updated 3 years ago
facebookresearch / romqa
A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
☆16Updated 2 years ago
devjwsong / t5-dst-modified-pytorch
Modified version of T5-DST for Dialogue State Tracking.
☆18Updated 3 years ago
seujung / t5-summarization
☆26Updated 4 years ago
Beomi / exbert-transformers
exBERT on Transformers🤗
☆10Updated 4 years ago
naver-ai / carecall-memory
Keep Me Updated! Memory Management in Long-term Conversations (Findings of EMNLP 2022)
☆31Updated 2 years ago
cosmoquester / transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
☆33Updated 2 years ago
sooftware / nlp-tasks
Natural Language Processing Tasks and Examples.
☆62Updated 2 years ago
convei-lab / BotsTalk
🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…
☆16Updated 9 months ago
microsoft / Efficient-Large-LM-Trainer
☆38Updated 11 months ago
hunkim / ACL-2020-Papers
Statistics and Accepted paper list of ACL 2020 with arXiv link
☆23Updated 5 years ago
naver-ai / MetricMT
The official code repository for MetricMT - a reward optimization method for NMT with learned metrics
☆25Updated 4 years ago
clovaai / pkm-transformers
Official implementation of PKM-augmented language models (Findings of EMNLP 2020)
☆10Updated 4 years ago
hyunwoongko / summarizers
Package for controllable summarization
☆78Updated 2 years ago