for-ai / iterative-data-selection

☆26

Alternatives and similar repositories for iterative-data-selection:

Users that are interested in iterative-data-selection are comparing it to the libraries listed below

JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆24Updated 3 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆33Updated 3 weeks ago
googleinterns / localizing-paragraph-memorization
☆12Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆33Updated 6 months ago
uservan / ThinkPO
☆16Updated last month
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆50Updated 6 months ago
kamanphoebe / Look-into-MoEs
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆46Updated last month
PKU-ML / LongPPL
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆44Updated last month
GAIR-NLP / benbench
Benchmarking Benchmark Leakage in Large Language Models
☆52Updated 10 months ago
TianduoWang / DPO-ST
[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
☆41Updated 7 months ago
yihedeng9 / DuoGuard
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
☆16Updated last month
Pranjal2041 / AdaptiveConsistency
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
☆36Updated last year
yyDing1 / ScaleQuest
We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
☆60Updated 4 months ago
HypherX / Evolution-Analysis
☆22Updated 3 months ago
gl-ybnbxb / BoNBoN
☆15Updated 9 months ago
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆45Updated 2 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆47Updated 9 months ago
hkust-nlp / PreSelect
☆40Updated 3 weeks ago
WindyLee0822 / Process_Q_Model
official implementation of paper "Process Reward Model with Q-value Rankings"
☆51Updated last month
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆35Updated 9 months ago
aryopg / decore
Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"
☆21Updated 3 months ago
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆80Updated 6 months ago
feiyang-k / AutoScale
Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]
☆12Updated last month
starrYYxuan / LeCo
This the implementation of LeCo
☆32Updated 2 months ago
abertsch72 / long-context-icl
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆32Updated 7 months ago
yifeiwang77 / Self-Correction
☆20Updated 4 months ago
locuslab / scaling_laws_data_filtering
☆64Updated 11 months ago
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆11Updated 3 months ago
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆35Updated 4 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year