ColinLu50 / SafeDeltaLinks
The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.
☆56Updated 5 months ago
Alternatives and similar repositories for SafeDelta
Users that are interested in SafeDelta are comparing it to the libraries listed below
Sorting:
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆42Updated 3 weeks ago
- ☆164Updated 3 weeks ago
- ☆82Updated 6 months ago
- A collection of papers related to knowledge fusion☆59Updated last year
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆72Updated 3 weeks ago
- [NeurIPS 2025] DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆73Updated 2 months ago
- ☆63Updated 3 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 9 months ago
- ☆44Updated 7 months ago
- Official Code of Logits-Based-Finetuning☆91Updated 5 months ago
- Code of "DrVideo: Document Retrieval Based Long Video Understanding"☆95Updated 3 months ago
- ☆157Updated 3 weeks ago
- AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning …☆94Updated last year
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆86Updated 5 months ago
- ☆62Updated last year
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆35Updated last year
- toolkit for WakenLLM framework☆47Updated 2 weeks ago
- ☆116Updated 4 months ago
- ☆139Updated this week
- Code for ICCV 2025 paper - Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-T…☆102Updated last month
- ☆95Updated last week