Alibaba-NLP / LaRALinks
The code for LaRA Benchmark
☆34Updated last month
Alternatives and similar repositories for LaRA
Users that are interested in LaRA are comparing it to the libraries listed below
Sorting:
- ☆55Updated last week
- ☆94Updated 6 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 8 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆105Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆58Updated last month
- ☆152Updated last month
- ☆82Updated last year
- ☆86Updated last month
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- Highly Efficient Query Rewriter for Passage Retrieval in the realm of Retrieval-Augmented Generation (RAG)☆25Updated last month
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆53Updated last month
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆36Updated last year
- ☆103Updated 6 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆47Updated 2 weeks ago
- ☆47Updated 2 weeks ago
- ☆49Updated 4 months ago
- ☆85Updated 2 weeks ago
- a toolkit on knowledge distillation for large language models☆95Updated this week
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆30Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆191Updated this week
- ☆95Updated 6 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- ☆36Updated 9 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- ☆48Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆152Updated 3 weeks ago
- ☆40Updated last year