tianyi-lab / Reflection_TuningLinks

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

☆362

Alternatives and similar repositories for Reflection_Tuning

Users that are interested in Reflection_Tuning are comparing it to the libraries listed below

Sorting:

xfactlab / orpo
Official repository for ORPO
☆463Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆439Updated last year
allenai / olmes
Reproducible, flexible LLM evaluations
☆257Updated this week
huggingface / cosmopedia
☆544Updated 11 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆242Updated 11 months ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆477Updated last year
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆642Updated 4 months ago
Cohere-Labs-Community / parameter-efficient-moe
☆271Updated last year
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆782Updated 7 months ago
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆520Updated 9 months ago
DataArcTech / LLM-as-a-Judge
☆146Updated last week
QwenLM / AutoIF
☆312Updated last year
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆218Updated 4 months ago
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆246Updated last year
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆190Updated last year
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆314Updated last year
eddycmu / demystify-long-cot
☆323Updated 4 months ago
fanqiwan / FuseAI
FuseAI Project
☆583Updated 8 months ago
Mohammadjafari80 / GSM8K-RLVR
A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.
☆136Updated 8 months ago
neelsjain / NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆401Updated last year
ContextualAI / gritlm
Generative Representational Instruction Tuning
☆675Updated 3 months ago
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆266Updated last week
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆261Updated last year
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆360Updated last year
HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆440Updated last year
mlfoundations / evalchemy
Automatic evals for LLMs
☆547Updated 3 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆240Updated 11 months ago
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆263Updated 3 months ago
hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆571Updated 10 months ago