Toolkit for attaching, training, saving and loading of new heads for transformer models
☆297Feb 12, 2026Updated 2 months ago
Alternatives and similar repositories for transformer-heads
Users that are interested in transformer-heads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆52Jul 10, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆7,023Mar 15, 2026Updated last month
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,566Mar 5, 2026Updated last month
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- ☆56Nov 6, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,771May 21, 2025Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆287Jul 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A compact LLM pretrained in 9 days by using high quality data☆340Apr 9, 2025Updated last year
- FuseAI Project☆595Jan 25, 2025Updated last year
- Training LLMs with QLoRA + FSDP☆1,542Nov 9, 2024Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Aug 22, 2024Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,724Apr 17, 2026Updated 2 weeks ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆511Aug 26, 2024Updated last year
- TempoPFN: Zero-shot Time Series Forecasting (accepted at EurIPS 2025 AI for Tabular Data Workshop)☆38Nov 10, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,779Updated this week
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,326Apr 25, 2026Updated last week
- Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.☆25Mar 27, 2025Updated last year
- Hill Space is All You Need☆17Jul 11, 2025Updated 9 months ago
- Automatically evaluate your LLMs in Google Colab☆688May 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 5 months ago
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,053Mar 7, 2024Updated 2 years ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Dec 19, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,587Apr 8, 2026Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,015Apr 20, 2026Updated last week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago