WENGSYX / LMTuner
LMTuner: Make the LLM Better for Everyone
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LMTuner
- FuseAI Project☆76Updated 3 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆61Updated 11 months ago
- Reformatted Alignment☆112Updated last month
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- Code implementation of synthetic continued pretraining☆60Updated last month
- ☆39Updated 5 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆73Updated 8 months ago
- Codebase for Instruction Following without Instruction Tuning☆31Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆51Updated 3 weeks ago
- ☆59Updated 2 weeks ago
- Cascade Speculative Drafting☆26Updated 7 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆52Updated last week
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated last week
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆62Updated 4 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆35Updated 3 months ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆62Updated 11 months ago
- ☆27Updated 5 months ago
- ☆22Updated 2 weeks ago
- CodeUltraFeedback: aligning large language models to coding preferences☆65Updated 4 months ago
- ☆31Updated 7 months ago
- evol augment any dataset online☆55Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 7 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 7 months ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- ☆30Updated this week
- A repository for research on medium sized language models.☆74Updated 5 months ago