Mihaiii / llm_steerView external linksLinks
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
☆270Jan 10, 2026Updated last month
Alternatives and similar repositories for llm_steer
Users that are interested in llm_steer are comparing it to the libraries listed below
Sorting:
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Mar 21, 2024Updated last year
- A library for making RepE control vectors☆685Sep 24, 2025Updated 4 months ago
- ☆338Jul 28, 2025Updated 6 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆140Feb 21, 2025Updated 11 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Apr 11, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- [ICLR 2025] General-purpose activation steering library☆142Sep 18, 2025Updated 4 months ago
- ☆17Jan 30, 2024Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆36Mar 20, 2023Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Oct 31, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Sep 13, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Large-scale LLM inference engine☆1,651Jan 21, 2026Updated 3 weeks ago
- Simplex Random Feature attention, in PyTorch☆75Oct 10, 2023Updated 2 years ago
- Scripts to create your own moe models using mlx☆90Feb 26, 2024Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Feb 23, 2024Updated last year
- This repository contains a framework for converting monocular videos into side-by-side (SBS) 3D videos. It utilizes a combination of imag…☆90Feb 11, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 8 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,555Jan 14, 2026Updated last month
- Steering Llama 2 with Contrastive Activation Addition☆209May 23, 2024Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- ☆119Dec 18, 2024Updated last year
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆858Jan 29, 2026Updated 2 weeks ago
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Create Custom LLMs☆1,806Nov 8, 2025Updated 3 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- Official repo for Learning to Reason for Long-Form Story Generation☆74Apr 19, 2025Updated 9 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,436Nov 13, 2024Updated last year
- A resource repository for representation engineering in large language models☆148Nov 14, 2024Updated last year