An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.
☆23Feb 20, 2025Updated last year
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆26Sep 10, 2024Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 weeks ago
- Implement of our TKDE paper: Hyperbolic Graph Learning for Social Recommendation☆13Jun 3, 2024Updated last year
- ☆21Dec 26, 2024Updated last year
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆24Jul 30, 2024Updated last year
- This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…☆36Mar 22, 2025Updated 11 months ago
- Tensorflow implementation of our SIGIR 2023 accepted paper "Generative-Contrastive Graph Learning for Recommendation"☆31Aug 26, 2024Updated last year
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆38Jan 13, 2025Updated last year
- Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional Generative Adversarial Networks for multiple dam domains☆11Jul 22, 2022Updated 3 years ago
- ☆14Feb 5, 2025Updated last year
- Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning☆33Feb 7, 2023Updated 3 years ago
- Official code for PLoP☆17Jun 30, 2025Updated 8 months ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- ☆20Dec 29, 2025Updated 2 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆51Nov 10, 2024Updated last year
- Simple Python Socket-based Split Learning technique using PyTorch☆13Mar 13, 2020Updated 5 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- Source code and dataset of the paper "Modality-Independent Graph Neural Networks with Global Transformers for Multimodal Recommendation",…☆51May 13, 2025Updated 9 months ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆49Jan 15, 2026Updated last month
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- ☆10Feb 6, 2025Updated last year
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆54Feb 2, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- ☆11Mar 12, 2024Updated last year
- Source code for a LoRA-based continual relation extraction method.☆14Sep 25, 2023Updated 2 years ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆16Oct 31, 2024Updated last year
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆67Jun 9, 2025Updated 8 months ago
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆14May 31, 2021Updated 4 years ago
- Implementation of SIGIR'25 accepted paper, focusing on social denoising recommendation☆13Apr 8, 2025Updated 10 months ago
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆34Jan 31, 2026Updated last month
- Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation☆12May 16, 2023Updated 2 years ago
- A comprehensive tool designed to enhance the retrieval and generation of academic content from the arXiv database, leveraging advanced Re…☆13Dec 30, 2024Updated last year
- ☆29Jan 7, 2026Updated last month
- [NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".☆16Dec 15, 2025Updated 2 months ago
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- The official implementation of InvRL☆13Oct 19, 2022Updated 3 years ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 10 months ago