zjunlp / ModelKinship
Exploring Model Kinship for Merging Large Language Models
☆24Updated 3 weeks ago
Alternatives and similar repositories for ModelKinship:
Users that are interested in ModelKinship are comparing it to the libraries listed below
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆58Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains☆77Updated this week
- Knowledge Unlearning for Large Language Models☆25Updated last week
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆45Updated 3 weeks ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆112Updated last year
- ☆17Updated 4 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆49Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 11 months ago
- ☆78Updated 3 months ago
- ☆24Updated 3 weeks ago
- ☆97Updated 10 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆35Updated 2 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆69Updated last month
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆89Updated 3 weeks ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆11Updated last year
- ☆114Updated 2 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆59Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆84Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- ☆72Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆71Updated this week
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆45Updated 3 months ago
- ☆59Updated 8 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆56Updated last month
- ☆46Updated 2 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 months ago
- This is the official repository for Inheritune.☆111Updated 3 months ago