princeton-nlp / LESSLinks

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

☆474

Alternatives and similar repositories for LESS

Users that are interested in LESS are comparing it to the libraries listed below

Sorting:

hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆561Updated 7 months ago
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆504Updated 6 months ago
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆268Updated 10 months ago
tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆381Updated last month
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆238Updated last year
sangmichaelxie / doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
☆340Updated last year
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆713Updated last month
dvlab-research / Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆375Updated 6 months ago
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆619Updated last month
ZigeW / data_management_LLM
Collection of training data management explorations for large language models
☆329Updated last year
MARIO-Math-Reasoning / Super_MARIO
☆337Updated last month
alon-albalak / data-selection-survey
A Survey on Data Selection for Language Models
☆245Updated 3 months ago
teacherpeterpan / self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
☆542Updated 9 months ago
OFA-Sys / InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
☆265Updated last year
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆238Updated last month
lqtrung1998 / mwp_ReFT
☆544Updated 7 months ago
LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆320Updated last year
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆654Updated 6 months ago
QwenLM / AutoIF
☆298Updated last year
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆345Updated last year
eddycmu / demystify-long-cot
☆306Updated 2 months ago
wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey
The related works and background techniques about Openai o1
☆224Updated 6 months ago
GAIR-NLP / auto-j
Generative Judge for Evaluating Alignment
☆244Updated last year
YuxiXie / MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆319Updated 11 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆360Updated last year
princeton-nlp / QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆183Updated last year
getao / icae
The repo for In-context Autoencoder
☆130Updated last year
RUCAIBox / HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆497Updated last year
THUNLP-MT / StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
☆165Updated 3 months ago