SkyworkAI / skywork-o1-prm-inferenceView external linksLinks
☆68Nov 26, 2024Updated last year
Alternatives and similar repositories for skywork-o1-prm-inference
Users that are interested in skywork-o1-prm-inference are comparing it to the libraries listed below
Sorting:
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Updated this week
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆12Apr 17, 2025Updated 9 months ago
- Useful Collection of Claude Code Configurations☆24Oct 20, 2025Updated 3 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆21Oct 16, 2025Updated 3 months ago
- ☆22Oct 23, 2025Updated 3 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆65Feb 5, 2025Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆30Nov 8, 2025Updated 3 months ago
- An automated data pipeline scaling RL to pretraining levels☆72Oct 11, 2025Updated 4 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,833Jan 17, 2025Updated last year
- UnifiedToolHub is a comprehensive project supporting LLM-based tool use, designed to unify various tool-use dataset formats and provide t…☆19Jul 23, 2025Updated 6 months ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated last month
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- ☆1,346Nov 21, 2024Updated last year
- ☆11Feb 6, 2026Updated last week
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆28Nov 13, 2025Updated 3 months ago
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 9 months ago
- ☆18Dec 12, 2025Updated 2 months ago
- ☆28Feb 11, 2025Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆741Jun 6, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆519Sep 27, 2025Updated 4 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆690Jan 20, 2025Updated last year
- O1 Replication Journey☆2,000Jan 14, 2025Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated 11 months ago
- ☆24Jan 22, 2025Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆26Jul 26, 2025Updated 6 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆59Feb 6, 2026Updated last week
- ☆57Feb 10, 2025Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- practical claude code commands and subagents☆65Jan 23, 2026Updated 3 weeks ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109May 27, 2025Updated 8 months ago
- A series of math-specific large language models of our Qwen2 series.☆1,065Jan 11, 2025Updated last year
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆283Feb 19, 2025Updated 11 months ago
- Scalable RL solution for advanced reasoning of language models☆1,803Mar 18, 2025Updated 10 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆193Mar 20, 2025Updated 10 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago