ColinLu50 / SafeDeltaLinks
The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.
☆56Updated 6 months ago
Alternatives and similar repositories for SafeDelta
Users that are interested in SafeDelta are comparing it to the libraries listed below
Sorting:
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated last month
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆42Updated last month
- ☆84Updated 6 months ago
- ☆164Updated last month
- [ACM MM 2025] Uni-Layout: Integrating Human Feedback in Unified Layout Generation and Evaluation☆44Updated last month
- ☆63Updated 4 months ago
- A collection of papers related to knowledge fusion☆59Updated last year
- [NeurIPS 2025] DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆73Updated 3 months ago
- ☆139Updated this week
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 10 months ago
- ☆157Updated last month
- ☆73Updated 4 months ago
- Useful suggestions for undergraduates in Artificial Intelligence school of BNU☆88Updated last week
- toolkit for WakenLLM framework☆47Updated last month
- ☆44Updated 8 months ago
- Official Code of Logits-Based-Finetuning☆91Updated 6 months ago
- Code for ICCV 2025 paper - Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-T…☆105Updated 2 months ago
- ☆116Updated 4 months ago
- Code of "DrVideo: Document Retrieval Based Long Video Understanding"☆97Updated 4 months ago
- ☆117Updated 3 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆42Updated last month
- ☆62Updated last year
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆86Updated 6 months ago
- Quick start with just one Python file for writing large models. No complex file structure or unnecessary explanations, perfect for beginn…☆41Updated 5 months ago
- [ICLR'2025 Spotlight] Official repository for "SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding"☆76Updated last month
- A high-performance Swift wrapper for MaxMind's GeoIP2 databases, offering thread-safe IP geolocation lookups with optimized memory manage…☆101Updated 7 months ago
- simple web ui to manage mcp (model context protocol) servers in the claude app☆104Updated 7 months ago
- ☆134Updated 10 months ago
- HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval☆79Updated 8 months ago
- ☆122Updated last year