2003pro / ScaleBiOLinks

This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

☆22

Alternatives and similar repositories for ScaleBiO

Users that are interested in ScaleBiO are comparing it to the libraries listed below

Sorting:

abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
deeplearning-wisc / args
☆46Updated last year
hkust-nlp / PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Updated last year
junkangwu / beta-DPO
[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
☆49Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated 11 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆84Updated 8 months ago
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆55Updated last year
VITA-Group / SEAL
[COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆45Updated 7 months ago
dannyallover / overthinking_the_truth
☆29Updated last year
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆87Updated last week
roeehendel / icl_task_vectors
☆101Updated 2 years ago
decoding-comp-trust / comp-trust
Codebase for decoding compressed trust.
☆25Updated last year
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆133Updated last year
tatsu-lab / test_set_contamination
☆41Updated 2 years ago
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆69Updated 3 years ago
OpenBMB / CPO
☆23Updated last year
cxcscmu / MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆76Updated last year
shizhediao / Black-Box-Prompt-Learning
Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"
☆56Updated 2 years ago
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆82Updated 11 months ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
RZFan525 / Awesome-ScalingLaws
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆79Updated 2 years ago
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆62Updated last year
allenai / hyper-task-descriptions
Learning adapter weights from task descriptions
☆19Updated 2 years ago
glorgao / SelectiveDPO
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
☆44Updated 4 months ago
srzer / MOD
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆28Updated last year
ZHZisZZ / modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆92Updated last year
ADaM-BJTU / W2SG
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Updated last year
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆50Updated 5 months ago
Zhou-Zoey / RMB-Reward-Model-Benchmark
☆45Updated 7 months ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆75Updated last year