[ICLR2026] Test-Time Scaling with Reflective Generative Model
☆302Jan 28, 2026Updated 4 months ago
Alternatives and similar repositories for XBai-o4
Users that are interested in XBai-o4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The open-source code of MetaStone-S1.☆106Aug 1, 2025Updated 10 months ago
- Marketplace ML experiment - training without backprop☆28Sep 9, 2025Updated 9 months ago
- ☆19Mar 3, 2025Updated last year
- ☆39Aug 4, 2025Updated 10 months ago
- gpt-oss + voice-ui-kit experiment☆150Aug 6, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MCP to provide secure IT tools for AI network troubleshooting (remote ssh, ping, nslookup, etc)☆17Apr 20, 2025Updated last year
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 11 months ago
- Portal: GUI Tools for Agents☆25Sep 18, 2025Updated 9 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆26Oct 1, 2025Updated 8 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- The State Of The Art, intelligence☆161Aug 12, 2025Updated 10 months ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Structural mirror consistency, observer perturbation, mirror break, and rupture auditing across supplied reflections, transformations, an…☆75Updated this week
- Your own Coding Agent 🤖☆105May 22, 2025Updated last year
- Gemma 2 optimized for your local machine.☆384Aug 7, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆75May 13, 2026Updated last month
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,113Jul 16, 2025Updated 11 months ago
- ☆18May 17, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper Don't Pay Attention☆59Sep 25, 2025Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Aug 18, 2024Updated last year
- This is a simple guide to help you build an Anthropic Claude Sonnet 3.5 chatbot interface with Gradio☆12Jun 23, 2024Updated last year
- ☆22Jan 29, 2026Updated 4 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆361Updated this week
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆35Aug 13, 2025Updated 10 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆324Oct 13, 2025Updated 8 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,645Apr 8, 2025Updated last year
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆965Jun 8, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- key/value store for Python based on Cloudflare workers☆33Jun 13, 2025Updated last year
- GRadient-INformed MoE☆264Sep 25, 2024Updated last year
- Amazon Bedrock AgentCore – Multi Framework Examples☆49Sep 24, 2025Updated 8 months ago
- NeurIPS 2026 paper: The Geometry of Consolidation — follow-up to HIDE and No-Escape.☆110May 5, 2026Updated last month
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆105Apr 7, 2026Updated 2 months ago
- Create and manage capitalization tables.☆26Jun 8, 2022Updated 4 years ago
- Lightweight Vision native Multimodal Document Agent☆160Sep 3, 2025Updated 9 months ago