gszfwsb / Data-WhispererLinks
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
☆20Updated 3 weeks ago
Alternatives and similar repositories for Data-Whisperer
Users that are interested in Data-Whisperer are comparing it to the libraries listed below
Sorting:
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆73Updated last week
- awesome SAE papers☆35Updated last month
- A Sober Look at Language Model Reasoning☆74Updated last week
- 📜 Paper list on decoding methods for LLMs and LVLMs☆49Updated last month
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆34Updated 5 months ago
- A curated list of resources for activation engineering☆90Updated last month
- ☆222Updated last week
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆37Updated 2 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆89Updated last week
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆28Updated 2 months ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆32Updated 4 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated 8 months ago
- ☆139Updated last month
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆45Updated 8 months ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆16Updated this week
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆25Updated last week
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆64Updated 6 months ago
- ☆16Updated 3 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆72Updated 2 weeks ago
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆13Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆89Updated 4 months ago
- ☆35Updated last year
- [arXiv 2025] Efficient Reasoning Models: A Survey☆184Updated this week
- One-shot Entropy Minimization☆149Updated 2 weeks ago
- ☆66Updated 4 months ago
- ☆47Updated 7 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆59Updated 3 months ago
- ☆109Updated 3 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆59Updated last year
- A curated list of Model Merging methods.☆92Updated 9 months ago