google-research / t5xLinks

☆2,858

Alternatives and similar repositories for t5x

Users that are interested in t5x are comparing it to the libraries listed below

Sorting:

EleutherAI / pythia
The hub for EleutherAI's work on interpretability and learning dynamics
☆2,582Updated 2 months ago
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,692Updated last year
google-research / FLAN
☆1,532Updated last month
young-geng / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,496Updated 11 months ago
EleutherAI / the-pile
☆1,589Updated 2 years ago
google / BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
☆3,092Updated last year
allenai / RL4LMs
A modular RL library to fine-tune language models to human preferences
☆2,333Updated last year
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,405Updated 3 months ago
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,341Updated last year
microsoft / DeBERTa
The implementation of DeBERTa
☆2,128Updated last year
microsoft / torchscale
Foundation Architecture for (M)LLMs
☆3,097Updated last year
togethercomputer / RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,786Updated 8 months ago
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,352Updated 3 months ago
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,441Updated 2 years ago
bigscience-workshop / promptsource
Toolkit for creating, sharing and using natural language prompts.
☆2,917Updated last year
lucidrains / toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,042Updated last year
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,427Updated last week
microsoft / mup
maximal update parametrization (µP)
☆1,576Updated last year
bigscience-workshop / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆1,406Updated last year
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,006Updated last year
anthropics / hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,770Updated last month
openai / following-instructions-human-feedback
☆1,231Updated 2 years ago
openai / lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
☆1,352Updated 2 years ago
IST-DASLab / gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
☆2,155Updated last year
huggingface / optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆3,005Updated last week
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆870Updated last year
adapter-hub / adapters
A Unified Library for Parameter-Efficient and Modular Transfer Learning
☆2,748Updated 2 months ago
stanford-crfm / helm
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models …
☆2,389Updated this week
huggingface / evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
☆2,285Updated 3 weeks ago
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆823Updated 2 years ago