QwenLM / Self-Lengthen
☆51Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Self-Lengthen
- ☆62Updated last month
- The Official Code Repository for GUI-World.☆37Updated 3 months ago
- ☆49Updated 3 weeks ago
- The official repository of the Omni-MATH benchmark.☆47Updated last week
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆85Updated last month
- Reformatted Alignment☆112Updated last month
- FuseAI Project☆76Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- Codebase for Instruction Following without Instruction Tuning☆30Updated last month
- Code implementation of synthetic continued pretraining☆54Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 3 months ago
- ☆57Updated last month
- ☆21Updated last month
- Expert Specialized Fine-Tuning☆144Updated last month
- ☆61Updated 2 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 7 months ago
- This is the official repository for Inheritune.☆105Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆61Updated 3 weeks ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- ☆44Updated last month
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆45Updated last month
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- HelloBench: evaluating long text generation capabilities of LLMs☆29Updated 3 weeks ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆34Updated 3 weeks ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆38Updated 3 months ago
- SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights☆34Updated 3 weeks ago