Toolkit for attaching, training, saving and loading of new heads for transformer models
☆294Feb 12, 2026Updated last month
Alternatives and similar repositories for transformer-heads
Users that are interested in transformer-heads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆52Jul 10, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆6,867Mar 15, 2026Updated last week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,564Mar 5, 2026Updated 2 weeks ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated last year
- ☆56Nov 6, 2024Updated last year
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,739May 21, 2025Updated 10 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆283Jul 11, 2024Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆338Apr 9, 2025Updated 11 months ago
- FuseAI Project☆592Jan 25, 2025Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,699Dec 11, 2025Updated 3 months ago
- Training LLMs with QLoRA + FSDP☆1,540Nov 9, 2024Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆684Aug 22, 2024Updated last year
- TempoPFN: Zero-shot Time Series Forecasting (accepted at EurIPS 2025 AI for Tabular Data Workshop)☆37Nov 10, 2025Updated 4 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆507Aug 26, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- Go ahead and axolotl questions☆11,460Updated this week
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,228Mar 6, 2026Updated 2 weeks ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.☆25Mar 27, 2025Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 4 months ago
- Tree-based indexes for neural-search☆31Mar 4, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Dec 19, 2023Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,956Mar 16, 2026Updated last week
- Robust recipes to align language models with human and AI preferences☆5,527Sep 8, 2025Updated 6 months ago