tml-epfl / icl-alignmentLinks

Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]

☆31

Alternatives and similar repositories for icl-alignment

Users that are interested in icl-alignment are comparing it to the libraries listed below

Sorting:

sail-sg / dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆44Updated 6 months ago
googleinterns / localizing-paragraph-memorization
☆15Updated last year
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 7 months ago
milesaturpin / cot-unfaithfulness
☆48Updated 2 years ago
tml-epfl / long-is-more-for-alignment
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆19Updated last year
qcznlp / uncertainty_attack
☆21Updated last month
formll / resolving-scaling-law-discrepancies
☆20Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆26Updated 10 months ago
XiangLi1999 / AutoBencher
☆32Updated last year
abertsch72 / long-context-icl
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆40Updated last year
allenai / hyper-task-descriptions
Learning adapter weights from task descriptions
☆19Updated last year
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆29Updated last year
GXimingLu / IPA
Codebase for Inference-Time Policy Adapters
☆24Updated last year
jiahai-feng / binding-iclr
☆16Updated last year
shadowkiller33 / Contrast-Instruction
☆19Updated 2 years ago
matchten / LoRA-Models-for-SAEs
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Updated 6 months ago
hbin0701 / Self-Explore
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆51Updated last year
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
activatedgeek / calibration-tuning
☆52Updated 6 months ago
MadryLab / DsDm
☆50Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆36Updated last year
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Updated 5 months ago
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆92Updated 11 months ago
yihuaihong / ConceptVectors
[EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"
☆35Updated 2 months ago
eth-lre / LLM_ICL
ACL24
☆10Updated last year
Nanami18 / Snowballed_Hallucination
☆44Updated last year
tatsu-lab / test_set_contamination
☆41Updated last year
Pranjal2041 / AdaptiveConsistency
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
☆39Updated last year