A method for steering llms to better follow instructions
☆91Jun 10, 2026Updated this week
Alternatives and similar repositories for llm-steer-instruct
Users that are interested in llm-steer-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆81Jan 16, 2026Updated 4 months ago
- ☆21May 14, 2026Updated last month
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆32Jul 31, 2025Updated 10 months ago
- Github Repo for ICML 2022 paper: Communication-Efficient Adaptive Federated Learning☆10Nov 18, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- This is an arbitrage scanner and auto trading bot created using python code, this bot works for Bybit, Binance, Kucoin, OKX and Bitget, y…☆10Apr 27, 2024Updated 2 years ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- A specification for DID create/update/deactivate operations.☆11Jan 3, 2025Updated last year
- ☆83Updated this week
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- ☆27Aug 8, 2025Updated 10 months ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- A template ReadMe, best-practices, conventions and other resources for azd (Azure Developer CLI) templates.☆21Sep 10, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Train transformer language models with reinforcement learning.☆19Feb 25, 2025Updated last year
- ☆12Feb 23, 2025Updated last year
- SimKO: Simple Pass@K Policy Optimization☆30Oct 24, 2025Updated 7 months ago
- ☆60Jun 7, 2026Updated last week
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆19May 27, 2025Updated last year
- (SIGIR 25) Repo for "Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation"☆11Jan 18, 2025Updated last year
- ☆17Jun 10, 2025Updated last year
- auto ticket reservation program (python)☆14Jan 28, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- ReMe: A Personalized Cognitive Training Framework Based on an LLM Voice Chatbot for Research☆17Jul 3, 2025Updated 11 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated 2 years ago
- Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024☆14Aug 1, 2025Updated 10 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Dec 21, 2023Updated 2 years ago
- ☆11Mar 5, 2025Updated last year
- Tayra is a sophisticated call center analytics platform designed to systematically evaluate and score call center audio interactions. By …☆14Dec 19, 2025Updated 5 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Feb 2, 2019Updated 7 years ago
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆58Feb 2, 2026Updated 4 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆151Feb 21, 2025Updated last year
- ☆25Jun 10, 2025Updated last year
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆18Aug 15, 2025Updated 9 months ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆19Nov 25, 2025Updated 6 months ago
- Linux installation guide for the 2024 Asus G14.☆15Dec 6, 2024Updated last year