[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆20Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for RULE-Unlearn
Users that are interested in RULE-Unlearn are comparing it to the libraries listed below
Sorting:
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 8 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆50Dec 23, 2025Updated 3 months ago
- ☆40Dec 16, 2025Updated 3 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 6 months ago
- A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022☆11Feb 1, 2023Updated 3 years ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 8 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69May 13, 2025Updated 10 months ago
- generative models on toys☆12Sep 10, 2024Updated last year
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆15Nov 25, 2025Updated 3 months ago
- ☆19Sep 8, 2025Updated 6 months ago
- Research Diary System - LaTeX-based academic diary with PDF/HTML compilation☆31Sep 29, 2025Updated 5 months ago
- ☆10Oct 29, 2020Updated 5 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Project page for continual diffusion pre-print paper☆12May 2, 2024Updated last year
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- Accurately and reliably defining organs at risk (OARs) and tumors are the cornerstone of radiation therapy (RT) treatment planning for lu…☆12Jul 19, 2023Updated 2 years ago
- ☆19May 3, 2025Updated 10 months ago
- Influence Maximization Paper List☆11May 11, 2022Updated 3 years ago
- Implementation of ENAS for CNNs on CIFAR 10☆11Oct 13, 2019Updated 6 years ago
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated last year
- [MICCAI-FLARE2022] Combining Self-Training and Hybrid Architecture for Semi-supervised Abdominal Organ Segmentation☆11Aug 24, 2022Updated 3 years ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Feb 18, 2026Updated last month
- ☆25Jun 17, 2025Updated 9 months ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- ☆15Feb 26, 2025Updated last year
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆24Jul 31, 2025Updated 7 months ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated 2 months ago
- Ensemble Learning of Foundation Models☆17Aug 29, 2025Updated 6 months ago
- Structural Deep Clustering Network☆13Apr 27, 2020Updated 5 years ago
- ☆11Oct 27, 2019Updated 6 years ago