Jihuai-wpy/InferAligner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jihuai-wpy/InferAligner)

Jihuai-wpy / InferAligner

Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.

☆38

Alternatives and similar repositories for InferAligner

Users that are interested in InferAligner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
Twilight92z / Quantize-Watermark
View on GitHub
☆19Nov 6, 2023Updated 2 years ago
xinghaow99 / BitStack
View on GitHub
[ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
☆38Feb 17, 2025Updated last year
0nutation / DUB
View on GitHub
Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)
☆29Jun 28, 2023Updated 3 years ago
fnlp-vision / UnifiedVisual
View on GitHub
Official repository for the EMNLP 2025 paper “UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets”.
☆16Sep 19, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gyhdog99 / ECSO
View on GitHub
ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)
☆37Nov 2, 2024Updated last year
fnlp-vision / DPA
View on GitHub
[EMNLP Findings'25] Official PyTorch Implementation of Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Align…
☆16Sep 19, 2025Updated 10 months ago
xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated 2 months ago
0nutation / SLMTokBench
View on GitHub
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
☆37Aug 29, 2023Updated 2 years ago
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated 2 months ago
OpenLMLab / Sniffer
View on GitHub
☆27Jun 5, 2023Updated 3 years ago
IBM / SafeLoRA
View on GitHub
Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"
☆29Dec 21, 2025Updated 7 months ago
SaFo-Lab / AdaShield
View on GitHub
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆73Feb 9, 2026Updated 5 months ago
uw-nsl / SafeDecoding
View on GitHub
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
☆154Jul 19, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SolidShen / RIPPLE_official
View on GitHub
☆20Feb 11, 2024Updated 2 years ago
EmbodiedForge / Inspire-cli
View on GitHub
A tool for better use of Inspire platform (Beta: Codeberg version is more up-to-date)
☆28Apr 2, 2026Updated 3 months ago
homles11 / SaLoRA
View on GitHub
Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”
☆29Oct 23, 2025Updated 9 months ago
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆218Oct 15, 2024Updated last year
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 3 weeks ago
ejhshen / SLIM
View on GitHub
Implementation of SLIM, a framework of dynamics skill lifecycle management for agentic reinforcement learning
☆22May 12, 2026Updated 2 months ago
Qinyu-Allen-Zhao / LVLM-LP
View on GitHub
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
☆43Nov 1, 2024Updated last year
0nutation / SpeechAgents
View on GitHub
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
☆87Jan 9, 2024Updated 2 years ago
PKU-YuanGroup / AsFT
View on GitHub
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆37Jul 10, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
NTU-SQUAD / transformers-coqa
View on GitHub
Albert for Conversational Question Answering Challenge
☆21Jun 12, 2023Updated 3 years ago
wang8740 / MAP
View on GitHub
Documentation at
☆14Mar 27, 2025Updated last year
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆404Updated this week
CyberAgentAILab / filtered-dpo
View on GitHub
[EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …
☆16Nov 27, 2024Updated last year
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆32Aug 21, 2024Updated last year
euReKa025 / AgentLongBench
View on GitHub
☆22Jan 29, 2026Updated 6 months ago
OpenMOSS / VehicleWorld
View on GitHub
VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …
☆24Sep 16, 2025Updated 10 months ago
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
DavidFanzz / SCMoE
View on GitHub
☆29May 24, 2024Updated 2 years ago
OpenMOSS / Thus-Spake-Long-Context-LLM
View on GitHub
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆62Mar 31, 2025Updated last year
ZHZisZZ / modpo
View on GitHub
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆101Aug 20, 2024Updated last year
ys-zong / VLGuard
View on GitHub
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
☆90Jan 19, 2025Updated last year
tongjingqi / Awesome-Agent-RL
View on GitHub
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …
☆60Sep 1, 2025Updated 10 months ago
Junjie-Ye / ToolSword
View on GitHub
[ACL 2024] ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
☆15Sep 12, 2024Updated last year
PromptLabs / hackaprompt
View on GitHub
☆21Dec 9, 2023Updated 2 years ago