SkyworkAI / skywork-o1-prm-inferenceLinks

☆65

Alternatives and similar repositories for skywork-o1-prm-inference

Users that are interested in skywork-o1-prm-inference are comparing it to the libraries listed below

Sorting:

GAIR-NLP / LIMR
☆213Updated 9 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆105Updated last year
sail-sg / oat-zero
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
☆248Updated 7 months ago
yyht / openrlhf_async_pipline
☆86Updated 3 months ago
THUDM / LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆256Updated 11 months ago
hkust-nlp / dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆119Updated 11 months ago
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆209Updated 2 weeks ago
nick7nlp / Counting-Stars
Counting-Stars (★)
☆83Updated 2 weeks ago
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆221Updated 4 months ago
IAAR-Shanghai / xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
☆140Updated 3 weeks ago
TemporaryLoRA / Temp-LoRA
☆122Updated last year
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆263Updated 5 months ago
ADaM-BJTU / OpenRFT
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆153Updated 11 months ago
InternLM / OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Updated 8 months ago
eddycmu / demystify-long-cot
☆327Updated 6 months ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆257Updated 6 months ago
Zanette-Labs / SpeculativeRejection
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆52Updated last year
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆112Updated 10 months ago
mtbench101 / mt-bench-101
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
☆130Updated last year
GAIR-NLP / ToRL
☆319Updated 6 months ago
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆92Updated 3 weeks ago
agentica-project / verl-pipeline
Async pipelined version of Verl
☆125Updated 7 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated last year
princeton-nlp / CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆167Updated last year
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆180Updated 4 months ago
QwenLM / AutoIF
☆316Updated last year
ganler / code-r1
Reproducing R1 for Code with Reliable Rewards
☆275Updated 7 months ago
tongyx361 / Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆145Updated last year
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆268Updated last year
RUCAIBox / SimpleDeepSearcher
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
☆111Updated 6 months ago