zjunlp / ModelKinshipLinks
Exploring Model Kinship for Merging Large Language Models
☆24Updated 2 months ago
Alternatives and similar repositories for ModelKinship
Users that are interested in ModelKinship are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 8 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆61Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 9 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆72Updated 3 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆37Updated last month
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆61Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆38Updated 3 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆115Updated last year
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆65Updated this week
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆46Updated 5 months ago
- ☆65Updated last year
- This is the official repository for Inheritune.☆111Updated 4 months ago
- ☆80Updated 5 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆95Updated 2 weeks ago
- A repository for research on medium sized language models.☆76Updated last year
- ☆29Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆141Updated 2 weeks ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated last month
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆49Updated 3 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆79Updated 10 months ago
- ☆123Updated 8 months ago
- ☆17Updated 5 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆144Updated 9 months ago
- This the implementation of LeCo☆31Updated 5 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51Updated last year
- ☆85Updated 7 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆56Updated 3 months ago