for-ai / iterative-data-selection
☆26Updated 4 months ago
Alternatives and similar repositories for iterative-data-selection:
Users that are interested in iterative-data-selection are comparing it to the libraries listed below
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆24Updated 3 months ago
- Exploration of automated dataset selection approaches at large scales.☆33Updated 3 weeks ago
- ☆12Updated last year
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- ☆16Updated last month
- Long Context Extension and Generalization in LLMs☆50Updated 6 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆46Updated last month
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆44Updated last month
- Benchmarking Benchmark Leakage in Large Language Models☆52Updated 10 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆41Updated 7 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆16Updated last month
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆36Updated last year
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 4 months ago
- ☆22Updated 3 months ago
- ☆15Updated 9 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆45Updated 2 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆47Updated 9 months ago
- ☆40Updated 3 weeks ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆51Updated last month
- Evaluate the Quality of Critique☆35Updated 9 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆21Updated 3 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆80Updated 6 months ago
- Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]☆12Updated last month
- This the implementation of LeCo☆32Updated 2 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆32Updated 7 months ago
- ☆20Updated 4 months ago
- ☆64Updated 11 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆11Updated 3 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year