[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆22Feb 26, 2025Updated last year
Alternatives and similar repositories for Distilling-CoT-Reasoning
Users that are interested in Distilling-CoT-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆16Oct 8, 2024Updated last year
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Dec 13, 2024Updated last year
- Repository of Streaming LLMs☆53Apr 16, 2026Updated 2 weeks ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆322Apr 2, 2026Updated last month
- ☆14Nov 19, 2024Updated last year
- ☆115Oct 21, 2025Updated 6 months ago
- Matlab implementation of our TMM 2020 paper "Pixel-level Non-local Image Smoothing with Objective Evaluation"☆10Nov 24, 2020Updated 5 years ago
- ☆17Jun 10, 2025Updated 10 months ago
- Recurrent Conditional Query Learning☆11Oct 26, 2025Updated 6 months ago
- ☆12Jun 13, 2025Updated 10 months ago
- ☆16Sep 4, 2025Updated 8 months ago
- ☆18Mar 2, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆24Jun 13, 2023Updated 2 years ago
- ☆22Oct 10, 2025Updated 6 months ago
- James' cookbook of evaluations and finetuning experiments☆26Feb 19, 2026Updated 2 months ago
- Awesome Long-CoT Data☆20Mar 26, 2025Updated last year
- ☆41Feb 11, 2025Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆68Aug 10, 2025Updated 8 months ago
- ☆19Mar 10, 2025Updated last year
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆32May 19, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆29Aug 25, 2024Updated last year
- ☆35May 16, 2025Updated 11 months ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆41Sep 8, 2025Updated 7 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆54Jul 15, 2025Updated 9 months ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 8 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆66Feb 21, 2025Updated last year
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- Documentation at☆14Mar 27, 2025Updated last year
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆91Nov 4, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ChemPile project☆19Jul 31, 2025Updated 9 months ago
- Active attention in classification networks that is optimised at the time of model training.☆11Nov 9, 2018Updated 7 years ago
- ☆50Mar 20, 2026Updated last month
- [SDM 2023] Probabilistic Decomposition Transformer for Time Series Forecasting☆18Sep 19, 2023Updated 2 years ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆215Nov 30, 2025Updated 5 months ago
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆16Mar 25, 2025Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆33Oct 20, 2025Updated 6 months ago