zjunlp / steer-target-atomsView external linksLinks
[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
☆35Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for steer-target-atoms
Users that are interested in steer-target-atoms are comparing it to the libraries listed below
Sorting:
- The PyTorch code for paper: "CONSK-GCN: Conversational Semantic- and Knowledge-Oriented Graph Convolutional Network for Multimodal Emotio…☆13Oct 21, 2022Updated 3 years ago
- ☆25Oct 22, 2025Updated 3 months ago
- ☆18Aug 19, 2024Updated last year
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated 10 months ago
- Code for our NAACL2025 accepted paper: Attention Tracker: Detecting Prompt Injection Attacks in LLMs☆20Sep 19, 2025Updated 4 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆30Jan 28, 2026Updated 2 weeks ago
- ☆28May 24, 2025Updated 8 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆84Jul 24, 2025Updated 6 months ago
- The code for LaRA Benchmark☆47May 28, 2025Updated 8 months ago
- ☆43Nov 1, 2024Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆39Jul 18, 2025Updated 6 months ago
- ☆18Feb 16, 2025Updated 11 months ago
- ☆11Oct 31, 2024Updated last year
- ☆11Aug 20, 2025Updated 5 months ago
- ☆35Mar 25, 2024Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 10 months ago
- ☆22Dec 11, 2025Updated 2 months ago
- ☆12Jun 4, 2023Updated 2 years ago
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆43Sep 22, 2025Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Aug 13, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆31Oct 15, 2025Updated 4 months ago
- Solana Leader TPU List and Event Stream☆32Feb 3, 2026Updated last week
- Long Context Research☆26Jan 26, 2026Updated 2 weeks ago
- 这是一个面向币圈新手的入门速通指南集合,包括最全面的币圈区块链资源集合,包含各类工具导航,快速了解币圈常用术语和行 话,详细的防骗指南,助你规避各类风险☆19Nov 6, 2025Updated 3 months ago
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- A set of general utilities for Stitches.☆13Jun 3, 2022Updated 3 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- ES module distribution for stdlib, a standard library for JavaScript and Node.js.☆13Jan 13, 2021Updated 5 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- An open-source Agent Skill framework implementing progressive disclosure architecture☆40Jan 30, 2026Updated 2 weeks ago
- ☆14Jan 24, 2025Updated last year
- ☆13May 21, 2023Updated 2 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 6 months ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 10 months ago
- ☆46Jun 11, 2025Updated 8 months ago