WENGSYX / LMTuner
LMTuner: Make the LLM Better for Everyone
☆34Updated last year
Alternatives and similar repositories for LMTuner:
Users that are interested in LMTuner are comparing it to the libraries listed below
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- FuseAI Project☆85Updated 2 months ago
- Reformatted Alignment☆115Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆44Updated 10 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.☆29Updated 5 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆36Updated 3 weeks ago
- ☆47Updated 4 months ago
- Exploration of automated dataset selection approaches at large scales.☆38Updated last month
- Codebase for Instruction Following without Instruction Tuning☆34Updated 6 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 9 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 4 months ago
- Cascade Speculative Drafting☆29Updated last year
- ☆49Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated 9 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 5 months ago
- ☆64Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 4 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated last year
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆75Updated 10 months ago
- ☆22Updated 4 months ago
- ☆36Updated 7 months ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- ☆24Updated 7 months ago