A method for steering llms to better follow instructions
☆82Aug 6, 2025Updated 6 months ago
Alternatives and similar repositories for llm-steer-instruct
Users that are interested in llm-steer-instruct are comparing it to the libraries listed below
Sorting:
- ☆25Dec 12, 2025Updated 2 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆73Jan 16, 2026Updated last month
- ☆18May 3, 2025Updated 10 months ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆19Nov 25, 2024Updated last year
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆26Jul 31, 2025Updated 7 months ago
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆20May 27, 2025Updated 9 months ago
- ☆17Oct 11, 2022Updated 3 years ago
- [EMNLP-2025] R1-Zero on ANY TASK☆27Nov 9, 2025Updated 3 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21May 18, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Dec 21, 2023Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆30Mar 26, 2024Updated last year
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- A red teaming agent☆18Oct 15, 2025Updated 4 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 4 months ago
- ☆95Dec 19, 2025Updated 2 months ago
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- ☆32Oct 18, 2024Updated last year
- Agent Innovator Lab – building AI agents on Azure, covering search optimization, agent design, evaluation, and RAG best practices.☆52Feb 20, 2026Updated last week
- A holistic benchmark for LLM abstention☆71Aug 27, 2025Updated 6 months ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆142Sep 21, 2024Updated last year
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆217Jun 23, 2025Updated 8 months ago
- ☆52Jul 31, 2024Updated last year
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- Resources and examples for creating Co-Pilot applications that use modern AI to assist with complex tasks.☆13Aug 27, 2025Updated 6 months ago
- Make Apps with ChatGPT and Generative AI, by Packt Publishing☆11Jul 25, 2025Updated 7 months ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago
- ☆10Nov 7, 2022Updated 3 years ago
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 7 months ago
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 8 months ago