gszfwsb / Data-WhispererLinks
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
☆44Updated 5 months ago
Alternatives and similar repositories for Data-Whisperer
Users that are interested in Data-Whisperer are comparing it to the libraries listed below
Sorting:
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆164Updated 6 months ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆90Updated last month
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆67Updated 10 months ago
- ☆111Updated 6 months ago
- A Survey of Direct Preference Optimization (DPO)☆87Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆83Updated 3 weeks ago
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆94Updated 9 months ago
- One-shot Entropy Minimization☆187Updated 6 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 6 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆40Updated 8 months ago
- ☆57Updated 5 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆83Updated last week
- ☆125Updated last year
- ☆152Updated last year
- Survey on Data-centric Large Language Models☆88Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 7 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆171Updated 2 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆85Updated 6 months ago
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆26Updated 3 weeks ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆163Updated 3 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 7 months ago
- Collection of latest papers and materials in the area of RLVR!☆45Updated last week
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆234Updated 2 months ago
- ☆176Updated last month
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 6 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆89Updated 11 months ago
- Agentic MLLMs☆128Updated 2 months ago
- ☆173Updated last year
- Extrapolating RLVR to General Domains without Verifiers☆187Updated 4 months ago