center-for-humans-and-machines / transformer-headsView external linksLinks
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆294Updated this week
Alternatives and similar repositories for transformer-heads
Users that are interested in transformer-heads are comparing it to the libraries listed below
Sorting:
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Jul 10, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 3 weeks ago
- FuseAI Project☆588Jan 25, 2025Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,555Jan 14, 2026Updated last month
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,719May 21, 2025Updated 8 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆507Aug 26, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,680Dec 11, 2025Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality data☆339Apr 9, 2025Updated 10 months ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆280Jul 11, 2024Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆685Aug 22, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,537Nov 9, 2024Updated last year
- Go ahead and axolotl questions☆11,289Updated this week
- ☆56Nov 6, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated last year
- The official repository for the Anything But Wrappers: Llama Edition Hackameetup☆22Sep 1, 2023Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆43Jan 24, 2024Updated 2 years ago
- Official PyTorch implementation of QA-LoRA☆145Mar 13, 2024Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Feb 11, 2026Updated last week
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,155Feb 8, 2026Updated last week
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- ☆45Oct 13, 2023Updated 2 years ago
- Synthetic data generator for image, video and 3D models☆32Aug 5, 2024Updated last year
- ☆52Oct 17, 2023Updated 2 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago