pytorch-tpu / transformersLinks

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

☆17

Alternatives and similar repositories for transformers

Users that are interested in transformers are comparing it to the libraries listed below

Sorting:

Cohere-Labs-Community / m-rewardbench
Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)
☆38Updated 6 months ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆113Updated 2 years ago
jason9693 / ETA4LLMs
Calculating Expected Time for training LLM.
☆38Updated 2 years ago
cisnlp / ofa
A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Updated 2 years ago
EleutherAI / polyglot-data
data related codebase for polyglot project
☆18Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
google-research / t5x_retrieval
☆101Updated 2 years ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆142Updated 10 months ago
luyug / GC-DPR
Train Dense Passage Retriever (DPR) with a single GPU
☆134Updated 4 years ago
ZurichNLP / multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆25Updated 5 months ago
cosmoquester / transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
☆34Updated 2 years ago
jason9693 / polyglot-finetuning-oslo
☆19Updated 3 years ago
Beomi / transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23Updated 4 years ago
facebookresearch / bart_ls
Long-context pretrained encoder-decoder models
☆96Updated 3 years ago
monologg / EncT5
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆63Updated 3 years ago
kaistAI / LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆95Updated last year
nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆97Updated 2 years ago
EleutherAI / dps
Data processing system for polyglot
☆92Updated 2 years ago
tatHi / maxmatch_dropout
☆10Updated 3 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆121Updated 2 years ago
naver-ai / carecall-corpus
CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).
☆61Updated 3 years ago
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated 2 years ago
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆105Updated 2 years ago
guijinSON / MM-Eval
Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"
☆16Updated last year
hyunwoongko / stop-sequencer
Implementation of stop sequencer for Huggingface Transformers
☆16Updated 2 years ago
konstantinjdobler / focus
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆34Updated 5 months ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆112Updated last month
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆85Updated last year