Toolkit for attaching, training, saving and loading of new heads for transformer models
☆299Feb 12, 2026Updated 4 months ago
Alternatives and similar repositories for transformer-heads
Users that are interested in transformer-heads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆53Jul 10, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆7,190Jun 17, 2026Updated 2 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆62Apr 8, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,571Mar 5, 2026Updated 3 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆200Feb 13, 2025Updated last year
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Genetics for Language Models☆18Jul 1, 2024Updated 2 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated 2 years ago
- ☆56Nov 6, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,800May 28, 2026Updated last month
- A compact LLM pretrained in 9 days by using high quality data☆342Apr 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Manage scalable open LLM inference endpoints in Slurm clusters☆288Jul 11, 2024Updated last year
- FuseAI Project☆601Jan 25, 2025Updated last year
- Training LLMs with QLoRA + FSDP☆1,549Nov 9, 2024Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Aug 22, 2024Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,761May 26, 2026Updated last month
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆30Jan 11, 2024Updated 2 years ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆517Aug 26, 2024Updated last year
- TempoPFN: Zero-shot Time Series Forecasting (accepted at EurIPS 2025 AI for Tabular Data Workshop)☆41Nov 10, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆19Aug 17, 2023Updated 2 years ago
- Go ahead and axolotl questions☆12,121Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,449Jun 26, 2026Updated last week
- Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.☆25Mar 27, 2025Updated last year
- Hill Space is All You Need☆17Jul 11, 2025Updated 11 months ago
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 7 months ago
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Dec 19, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,623May 26, 2026Updated last month
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆33Jun 5, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,138May 26, 2026Updated last month