A method for steering llms to better follow instructions
☆83Aug 6, 2025Updated 7 months ago
Alternatives and similar repositories for llm-steer-instruct
Users that are interested in llm-steer-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19May 3, 2025Updated 10 months ago
- ☆25Dec 12, 2025Updated 3 months ago
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆14Mar 14, 2026Updated last week
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆25Jul 31, 2025Updated 7 months ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 2 years ago
- How to use OpenAI API?☆12Nov 23, 2023Updated 2 years ago
- [EMNLP-2025] R1-Zero on ANY TASK☆30Nov 9, 2025Updated 4 months ago
- ☆23Dec 5, 2025Updated 3 months ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- ☆22Sep 5, 2025Updated 6 months ago
- [ICLR 2025] General-purpose activation steering library☆150Sep 18, 2025Updated 6 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- ☆54Jul 31, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- ☆18Aug 18, 2024Updated last year
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- ☆22Aug 8, 2025Updated 7 months ago
- ☆16May 18, 2023Updated 2 years ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆31Jan 30, 2026Updated last month
- ☆61Sep 18, 2025Updated 6 months ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆15Nov 25, 2025Updated 3 months ago
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆20May 27, 2025Updated 9 months ago
- (SIGIR 25) Repo for "Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation"☆10Jan 18, 2025Updated last year
- Generative AI Operations Solution Accelerator☆84Nov 5, 2025Updated 4 months ago
- ☆16Jun 10, 2025Updated 9 months ago
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- ReMe: A Personalized Cognitive Training Framework Based on an LLM Voice Chatbot for Research☆17Jul 3, 2025Updated 8 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Dec 21, 2023Updated 2 years ago
- Artificial Intelligence for Cybersecurity, published by Packt☆24Mar 2, 2026Updated 3 weeks ago
- ☆11Mar 5, 2025Updated last year
- ☆14Jun 19, 2023Updated 2 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆16Aug 15, 2025Updated 7 months ago
- Molecule QEMU driver for testing Ansible roles☆13Oct 28, 2024Updated last year
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- ☆64Oct 8, 2024Updated last year