[ACL2025 Best Paper] Language Models Resist Alignment
☆43Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for llms-resist-alignment
Users that are interested in llms-resist-alignment are comparing it to the libraries listed below
Sorting:
- ☆15Updated this week
- ☆16Apr 7, 2025Updated 10 months ago
- ☆19Sep 22, 2025Updated 5 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- ☆33Jan 7, 2025Updated last year
- ☆43Feb 9, 2026Updated 3 weeks ago
- QuESt Planning is a long-term power system capacity expansion planning model that identifies cost-optimal energy storage, generation, and…☆14Feb 4, 2026Updated 3 weeks ago
- ☆19Nov 20, 2025Updated 3 months ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago
- Example Systems using PowerDynamics.jl☆12Oct 10, 2022Updated 3 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated last month
- yolo目标检测算法☆15Jul 27, 2025Updated 7 months ago
- ☆12Mar 15, 2023Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- ☆16Jan 16, 2025Updated last year
- ☆14May 1, 2023Updated 2 years ago
- ☆14Jan 8, 2026Updated last month
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆25Updated this week
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Jan 9, 2024Updated 2 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- ☆10Mar 25, 2024Updated last year
- ☆13Jun 25, 2025Updated 8 months ago
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆12Jan 12, 2026Updated last month
- LAMPOS, a strategy-based solution approach for mp-MILPs for real-time mixed-integer MPC with sub-optimality quantification☆11Jun 25, 2023Updated 2 years ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 2 months ago
- ☆11Jan 19, 2025Updated last year
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Nov 12, 2021Updated 4 years ago
- Implementation of "Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes"☆12Oct 2, 2024Updated last year
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- ☆16Jul 7, 2025Updated 7 months ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆13Nov 5, 2025Updated 3 months ago