hiyouga / transformersLinks

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

☆10

Alternatives and similar repositories for transformers

Users that are interested in transformers are comparing it to the libraries listed below

Sorting:

ulab-uiuc / ToMAP
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆14Updated last month
padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 10 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
choosewhatulike / case2code
☆15Updated 3 months ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 8 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆19Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆35Updated last year
cohere-ai / human-feedback-paper
Code and data from the paper 'Human Feedback is not Gold Standard'
☆19Updated last year
The-Swarm-Corporation / AgentParse
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆14Updated this week
anyscale / long-context-fine-tuning-blogpost
☆17Updated last year
microsoft / Structured-Entity-Extraction
Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"
☆42Updated 3 months ago
SciPhi-AI / RAG-Performance
Measuring RAG solutions throughput and latency
☆18Updated last year
Hritikbansal / sparse_feedback
☆29Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 7 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
ziegler-ingo / CRAFT
Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…
☆30Updated 10 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
nuochenpku / COMEDY
This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…
☆19Updated 8 months ago
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆16Updated last year
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆74Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆55Updated 3 weeks ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
likenneth / persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
☆17Updated last year
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆65Updated 3 months ago
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆38Updated last year
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago
RUCAIBox / ChainLM
☆31Updated last year