rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
☆39Jan 13, 2025Updated last year
Alternatives and similar repositories for rStar-Math
Users that are interested in rStar-Math are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆1,402Sep 12, 2025Updated 6 months ago
- ☆44Feb 4, 2026Updated last month
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning☆50Oct 11, 2024Updated last year
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Feb 19, 2025Updated last year
- This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"☆65Dec 29, 2025Updated 2 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆224Jul 25, 2025Updated 7 months ago
- ☆968Jan 23, 2025Updated last year
- A proofreading tool using Google's N-gram corpus.☆12Sep 2, 2022Updated 3 years ago
- Pytorch 文本分类温习练习,本项目主要针对短文本的简单分类,demo看看就好。这里用到的网络有:FastText、TextCNN、TextRNN、TextRCNN、Transformer☆17May 27, 2020Updated 5 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆96Feb 2, 2026Updated last month
- IAN: An Intelligent System for Omics Data Analysis and Discovery☆10Feb 23, 2026Updated last month
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- ☆72Apr 2, 2024Updated last year
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27May 6, 2021Updated 4 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- ☆20Jan 7, 2024Updated 2 years ago
- AN O1 REPLICATION FOR CODING☆333Dec 11, 2024Updated last year
- Official code repo of SimMLM [ICCV 2025]☆22Dec 1, 2025Updated 3 months ago
- Ongoing research project for code&math LLMs☆27Jul 4, 2025Updated 8 months ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Compare how fine-tuned AI video models interpret the same prompts☆14Jan 29, 2025Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Mar 16, 2026Updated last week
- ☆337May 24, 2025Updated 10 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- ☆17Dec 23, 2025Updated 3 months ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 7 months ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 6 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆115Feb 2, 2026Updated last month
- ☆14Oct 11, 2023Updated 2 years ago
- ☆11Dec 15, 2025Updated 3 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆71Sep 13, 2025Updated 6 months ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- [Neurocomputing] Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation☆23Dec 21, 2025Updated 3 months ago