Alibaba-NLP / LaRALinks
The code for LaRA Benchmark
☆35Updated last month
Alternatives and similar repositories for LaRA
Users that are interested in LaRA are comparing it to the libraries listed below
Sorting:
- ☆94Updated 7 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆110Updated 2 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆31Updated last month
- ☆61Updated this week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆135Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- ☆90Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆88Updated this week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆196Updated 2 weeks ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆62Updated 9 months ago
- ☆102Updated 7 months ago
- ☆155Updated 2 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆60Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 9 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆255Updated 2 weeks ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"☆88Updated 2 months ago
- ☆83Updated last year
- ☆81Updated 2 months ago
- ☆33Updated last month
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆56Updated 2 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆54Updated 3 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 8 months ago
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆136Updated 3 weeks ago
- ☆47Updated last month
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆36Updated 2 months ago
- ☆56Updated 8 months ago
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆81Updated last year