[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
☆41Jun 4, 2025Updated 11 months ago
Alternatives and similar repositories for steer-target-atoms
Users that are interested in steer-target-atoms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for our NAACL2025 accepted paper: Attention Tracker: Detecting Prompt Injection Attacks in LLMs☆23Sep 19, 2025Updated 8 months ago
- ☆18Aug 19, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 6 months ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆96May 6, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆39Dec 19, 2025Updated 5 months ago
- ☆10Oct 15, 2019Updated 6 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- ☆17May 31, 2024Updated last year
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- ☆12Aug 8, 2023Updated 2 years ago
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆39Oct 15, 2025Updated 7 months ago
- A benchmark for evaluating the efficiency of LLM-generated code☆17Apr 17, 2025Updated last year
- Papers and Related work to help learn ICL conveniently for everyone who interests.☆14Feb 28, 2024Updated 2 years ago
- [2025-上海人工智能实验室书生实训 营十佳、优秀项目]☆43Sep 22, 2025Updated 8 months ago
- ☆19Aug 4, 2025Updated 9 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Dec 25, 2023Updated 2 years ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆35Oct 12, 2025Updated 7 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 4 months ago
- ☆20Jan 5, 2023Updated 3 years ago
- A Multi-task learning framework for personality trait detection☆11Jan 13, 2021Updated 5 years ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆40Jul 18, 2025Updated 10 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆120Oct 23, 2023Updated 2 years ago
- Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization☆12Jul 15, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A simple Python script to convert FOA audio to binaural.☆16Nov 29, 2022Updated 3 years ago
- An automated data pipeline scaling RL to pretraining levels☆77Oct 11, 2025Updated 7 months ago
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple …☆15Mar 30, 2023Updated 3 years ago
- SpectraGuru - A Spectra Analysis Application☆35May 15, 2026Updated last week
- ☆28Oct 28, 2024Updated last year
- [ACL 2025] The official implementation of the paper "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free".☆76Dec 4, 2025Updated 5 months ago