tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆354Updated 8 months ago
Alternatives and similar repositories for Reflection_Tuning:
Users that are interested in Reflection_Tuning are comparing it to the libraries listed below
- Official repository for ORPO☆450Updated 11 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆693Updated last month
- RewardBench: the first evaluation tool for reward models.☆562Updated this week
- ☆287Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆482Updated 3 months ago
- ☆515Updated 5 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆409Updated last year
- ☆671Updated last week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆195Updated last month
- ☆279Updated 9 months ago
- Automatic evals for LLMs☆376Updated this week
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆395Updated 11 months ago
- Generative Representational Instruction Tuning☆626Updated last month
- Reproducible, flexible LLM evaluations☆198Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆238Updated 5 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆303Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆221Updated 6 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆214Updated last month
- FuseAI Project☆563Updated 3 months ago
- Code for Quiet-STaR☆731Updated 8 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 11 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆311Updated 11 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆244Updated 2 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆190Updated 5 months ago
- ☆924Updated 3 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆189Updated 9 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆240Updated last year
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆205Updated 2 years ago