An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.
☆24Feb 20, 2025Updated last year
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆26Sep 10, 2024Updated last year
- Implement of our TKDE paper: Hyperbolic Graph Learning for Social Recommendation☆13Jun 3, 2024Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Dec 26, 2024Updated last year
- Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning☆33Feb 7, 2023Updated 3 years ago
- ☆14Oct 7, 2023Updated 2 years ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆24Jul 30, 2024Updated last year
- This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…☆36Mar 22, 2025Updated last year
- Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional Generative Adversarial Networks for multiple dam domains☆11Jul 22, 2022Updated 3 years ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆39Jan 13, 2025Updated last year
- Source code and dataset of the paper "Modality-Independent Graph Neural Networks with Global Transformers for Multimodal Recommendation",…☆52May 13, 2025Updated 11 months ago
- ☆21Oct 25, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆52Nov 10, 2024Updated last year
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- ☆44Oct 1, 2024Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- ☆34Jan 15, 2026Updated 3 months ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆49Jan 15, 2026Updated 3 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- [NeurIPS2024] Official Codes of the Paper "Gradient Guidance for Diffusion Models: An Optimization Perspective"☆25Mar 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆55Feb 2, 2025Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆66Jun 9, 2025Updated 10 months ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆17Oct 31, 2024Updated last year
- Code Repository for NeurIPS 2021 accepted paper, named "Torwards Gradient-based Bilevel Optimization with non-convex Followers and Beyond…☆11Mar 28, 2022Updated 4 years ago
- This repository is the official implementation of the source code of the paper "B2Opt: Learning to Optimize Black-box Optimization with L…☆11Aug 16, 2024Updated last year
- ☆10Feb 6, 2025Updated last year
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 5 years ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆81Sep 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A comprehensive tool designed to enhance the retrieval and generation of academic content from the arXiv database, leveraging advanced Re…☆13Dec 30, 2024Updated last year
- ☆17Jun 11, 2025Updated 10 months ago
- [ACMMM 2024] Official PyTorch implementation for "Enhancing Images with Coupled Low-Resolution and Ultra-Dark Degradations: A Tri-level L…☆11Oct 10, 2024Updated last year
- Code for ICML 2023 paper named "Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity"☆14Jan 14, 2025Updated last year
- Official code for PLoP☆18Mar 6, 2026Updated last month
- 完整的 scrapy 爬虫示例,爬取股票和新闻数据☆15Aug 15, 2020Updated 5 years ago
- Upscaling satellite imagery using GAN by 4 times☆16Mar 14, 2022Updated 4 years ago