☆13Jan 14, 2025Updated last year
Alternatives and similar repositories for safety-arithmetic
Users that are interested in safety-arithmetic are comparing it to the libraries listed below
Sorting:
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- ☆20Aug 8, 2025Updated 6 months ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Vite + Mantine + Vanilla extract template☆12Feb 16, 2026Updated 2 weeks ago
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- ☆18Jan 20, 2026Updated last month
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 4 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- Download TikTok videos online with TikTok Video Downloader. Completely free.☆13Sep 17, 2025Updated 5 months ago
- Eval LLMs☆11May 12, 2024Updated last year
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- a neural network trainer for weebs☆14Updated this week
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆13Jan 7, 2025Updated last year
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- ☆12Dec 21, 2025Updated 2 months ago
- ☆26Jun 28, 2025Updated 8 months ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 2 months ago
- A copy of the DirectX Headers from MinGW-64.☆13Sep 7, 2023Updated 2 years ago
- ☆11Jan 24, 2022Updated 4 years ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Official Implementation of Geo2Vec oral presented @ [AAAI '2026]☆31Nov 22, 2025Updated 3 months ago
- An application that brings together several anime streaming platforms☆10Mar 1, 2025Updated last year
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- The GIF-to-Chatter app you didn't know you needed!☆15Feb 12, 2022Updated 4 years ago
- This repository contains a curated list of resources related to World Models for Autonomous Driving (WMAD), based on the survey.☆27Oct 10, 2025Updated 4 months ago
- ☆16Jan 7, 2025Updated last year
- LLM手撕代码合集☆19Mar 25, 2025Updated 11 months ago
- ☆13Jan 15, 2025Updated last year
- ☆12Feb 19, 2024Updated 2 years ago
- Text summation using python, deep learning, machine learning, transformer, huggingface, openai and langchain☆13Nov 26, 2024Updated last year