[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 2 months ago
Alternatives and similar repositories for LookAheadTuning
Users that are interested in LookAheadTuning are comparing it to the libraries listed below
Sorting:
- Implementation of AdaCQR(COLING 2025)☆13Dec 30, 2024Updated last year
- Create, Evaluate, and Connect AI Skills☆61Updated this week
- [WWW 2026] BaiJia: An Open Role-Playing Platform of Chinese Historical Characters☆25Jan 14, 2026Updated last month
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆24Oct 8, 2024Updated last year
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation (NeurIPS 2022)☆33Dec 16, 2022Updated 3 years ago
- OceanGym: A Benchmark Environment for Underwater Embodied Agents☆95Jan 29, 2026Updated last month
- ☆52Oct 23, 2023Updated 2 years ago
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆10Jul 15, 2024Updated last year
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- Python+OpenCV实现OCR车牌识别,能够实现车牌实时识别以及车牌的监测报警功能☆10Jul 5, 2020Updated 5 years ago
- 实现欧拉视频放大并用于心率检测等☆12Jul 30, 2018Updated 7 years ago
- [EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆47Jan 23, 2025Updated last year
- ☆14Feb 26, 2025Updated last year
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 9 months ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 weeks ago
- ☆19May 14, 2025Updated 9 months ago
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆11Jan 19, 2025Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆18Nov 17, 2025Updated 3 months ago
- 2017年南京大学计算机专业保研推免夏令营机试试题之一☆12Sep 24, 2017Updated 8 years ago
- ☆16Apr 7, 2025Updated 10 months ago
- Android Studio基于mediapipe的手势控制☆10Mar 11, 2020Updated 5 years ago
- [ICLR 2026] "When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms"☆26Feb 3, 2026Updated 3 weeks ago
- 哈工大信息安全专业实验报告及代码整理☆15Dec 23, 2018Updated 7 years ago
- [NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes"☆10Sep 18, 2025Updated 5 months ago
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆22Jul 29, 2025Updated 7 months ago
- ☆16Mar 22, 2025Updated 11 months ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- ☆10Nov 15, 2020Updated 5 years ago
- Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning☆31Jan 14, 2026Updated last month
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆59Oct 28, 2025Updated 4 months ago
- enchmarking Large Language Models' Resistance to Malicious Code☆14Dec 1, 2024Updated last year
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 9 months ago
- ☆16Oct 11, 2025Updated 4 months ago
- The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!☆14Apr 8, 2025Updated 10 months ago