[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
☆39Jun 4, 2025Updated 10 months ago
Alternatives and similar repositories for steer-target-atoms
Users that are interested in steer-target-atoms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 19, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated last month
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆95Apr 8, 2026Updated last week
- ☆32Oct 22, 2025Updated 5 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Oct 15, 2019Updated 6 years ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆86Sep 1, 2025Updated 7 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- ☆12Aug 8, 2023Updated 2 years ago
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A benchmark for evaluating the efficiency of LLM-generated code☆17Apr 17, 2025Updated 11 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- code for DOMI☆11Mar 24, 2023Updated 3 years ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 3 months ago
- A Multi-task learning framework for personality trait detection☆11Jan 13, 2021Updated 5 years ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆40Jul 18, 2025Updated 8 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆17Apr 17, 2022Updated 3 years ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆17Feb 27, 2025Updated last year
- SpectraGuru - A Spectra Analysis Application☆34Updated this week
- ☆28Oct 28, 2024Updated last year
- Binaural Spatializer Audio Plugin☆23Jun 25, 2024Updated last year
- ☆18Oct 8, 2024Updated last year
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆31Jul 20, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation☆19Aug 26, 2023Updated 2 years ago
- Speaker adaptive forced alignment (phonetic segmentation) using Wav2Vec2☆23Mar 5, 2026Updated last month
- Bridging Immutable and Mutable Abstractions for Distributed Data Analytics☆12May 15, 2019Updated 6 years ago
- AlphaVerus: Formally Verified Code Generation through Self-Improving Translation and Treefinement☆29May 14, 2025Updated 11 months ago
- ☆15Mar 7, 2020Updated 6 years ago
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆51Jan 16, 2026Updated 3 months ago
- ☆20Apr 25, 2020Updated 5 years ago