☆20Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for Small-Model-Learnability-Gap
Users that are interested in Small-Model-Learnability-Gap are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts☆25Feb 23, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆80Sep 19, 2025Updated 5 months ago
- [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆37Jun 5, 2025Updated 8 months ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆33Aug 23, 2025Updated 6 months ago
- ☆46Mar 4, 2025Updated 11 months ago
- ☆10Feb 22, 2022Updated 4 years ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Feb 18, 2026Updated last week
- ☆19Jul 8, 2025Updated 7 months ago
- ☆13Jun 25, 2025Updated 8 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆22Sep 25, 2025Updated 5 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆14Mar 24, 2025Updated 11 months ago
- ☆14Oct 17, 2024Updated last year
- 🔧 Custom utils. 供日常使用的脚本小工具。☆10Jun 14, 2024Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆15Feb 7, 2025Updated last year
- DINO-based perceptual losses and FDD feature extraction☆25Jan 7, 2026Updated last month
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- ☆43May 6, 2024Updated last year
- An official implementation for the KDD 2025 paper 'Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Eff…☆22Jun 4, 2025Updated 8 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆23Oct 3, 2025Updated 4 months ago
- A Chinese Character BERT Trained with Multi-Level Masking☆11Sep 24, 2023Updated 2 years ago
- ☆16Jun 10, 2025Updated 8 months ago
- This is the official repository of the paper Exploring Superior Function Calls via Reinforcement Learning.☆34Aug 11, 2025Updated 6 months ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- ☆12Feb 16, 2024Updated 2 years ago
- ☆54May 22, 2025Updated 9 months ago
- ☆46Dec 30, 2024Updated last year
- 西北工业大学U14M11107计算机视觉课程作业☆13Nov 9, 2022Updated 3 years ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- CLI util: Poor man's rpath for Windows executables.☆12Dec 16, 2018Updated 7 years ago