SmallDoges / small-datasetsLinks
Distill thinking dataset more compactly and accurately!
☆36Updated 7 months ago
Alternatives and similar repositories for small-datasets
Users that are interested in small-datasets are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- ☆131Updated 8 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 9 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆107Updated 8 months ago
- Efficient Agent Training for Computer Use☆135Updated 4 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆189Updated 6 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 5 months ago
- ☆97Updated 2 weeks ago
- ☆85Updated 2 months ago
- Complex Function Calling Benchmark.☆162Updated last year
- ☆95Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆116Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆51Updated last year
- ☆26Updated last year
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 2 months ago
- FuseAI Project☆88Updated 11 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆211Updated last month
- accompanying material for sleep-time compute paper☆118Updated 8 months ago
- ☆88Updated 8 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆95Updated 8 months ago
- Official code repository for Sketch-of-Thought (SoT)☆132Updated 8 months ago
- ☆83Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆96Updated 8 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 10 months ago
- ☆55Updated last year
- This is the official repository for Inheritune.☆120Updated 11 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago