jiahai-feng / binding-iclr
☆12Updated last year
Alternatives and similar repositories for binding-iclr:
Users that are interested in binding-iclr are comparing it to the libraries listed below
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆34Updated 4 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 2 months ago
- Self-Supervised Alignment with Mutual Information☆16Updated 9 months ago
- ☆18Updated 8 months ago
- ☆30Updated 2 months ago
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆33Updated last month
- ☆26Updated 3 weeks ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆44Updated last month
- ☆18Updated 4 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆47Updated 2 weeks ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆11Updated 7 months ago
- ☆38Updated last year
- ☆15Updated 6 months ago
- ☆30Updated 3 months ago
- Augmenting Statistical Models with Natural Language Parameters☆23Updated 6 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated last month
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- ☆37Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆15Updated 7 months ago
- ☆28Updated 8 months ago
- Codebase for Inference-Time Policy Adapters☆23Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆74Updated 5 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆16Updated 10 months ago