[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 3 months ago
Alternatives and similar repositories for LookAheadTuning
Users that are interested in LookAheadTuning are comparing it to the libraries listed below
Sorting:
- Implementation of AdaCQR(COLING 2025)☆13Dec 30, 2024Updated last year
- [WWW 2026] BaiJia: An Open Role-Playing Platform of Chinese Historical Characters☆25Jan 14, 2026Updated 2 months ago
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆19Nov 17, 2025Updated 4 months ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆23Oct 8, 2024Updated last year
- OceanGym: A Benchmark Environment for Underwater Embodied Agents☆100Jan 29, 2026Updated last month
- [EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆47Jan 23, 2025Updated last year
- enchmarking Large Language Models' Resistance to Malicious Code☆14Dec 1, 2024Updated last year
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 9 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation (NeurIPS 2022)☆33Dec 16, 2022Updated 3 years ago
- 实现欧拉视频放大 并用于心率检测等☆12Jul 30, 2018Updated 7 years ago
- ☆52Oct 23, 2023Updated 2 years ago
- ☆14Feb 26, 2025Updated last year
- Android Studio基于mediapipe的手势控制☆10Mar 11, 2020Updated 6 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆10Jul 15, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- ☆19May 14, 2025Updated 10 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last month
- [NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes"☆10Sep 18, 2025Updated 6 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆51Oct 18, 2024Updated last year
- RestContent is a Headless CMS written in Go+Alpine, supports multiple sites, media libraries, and multiple users, and provides content ma…☆13Jan 23, 2024Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- ☆39Sep 13, 2025Updated 6 months ago
- 2017年南京大学计算机专业保研推免夏令营机试试题之一☆12Sep 24, 2017Updated 8 years ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Jun 18, 2024Updated last year
- Python+OpenCV实现OCR车牌识别,能够实现车牌实时识别以及车牌的监测报警功能☆11Jul 5, 2020Updated 5 years ago
- Demo code for the paper: One Thing to Fool them All: Generating Interpretable, Universal, and Physically-Realizable Adversarial Features☆12Nov 30, 2023Updated 2 years ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆23Jul 29, 2025Updated 7 months ago
- 直接在python中使用谷歌mediapipe的手关键点检测模型☆12Jun 19, 2020Updated 5 years ago
- ☆11Jun 20, 2023Updated 2 years ago
- ☆21Aug 2, 2024Updated last year
- ☆18Apr 7, 2025Updated 11 months ago
- [COLING 2025] Official code of the paper "The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models"☆59Dec 26, 2024Updated last year