MetaStone-AI / XBai-o4Links
[ICLR2026] Test-Time Scaling with Reflective Generative Model
☆302Updated 2 weeks ago
Alternatives and similar repositories for XBai-o4
Users that are interested in XBai-o4 are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆82Updated 5 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆196Updated last month
- The State Of The Art, intelligence☆157Updated 5 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆274Updated 3 months ago
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆185Updated 7 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- ☆159Updated 9 months ago
- An open-source application for building, observing, and collaborating with teams of AI agents.☆419Updated 5 months ago
- The Open Deep Research app – generate reports with OSS LLMs☆316Updated 2 weeks ago
- Context Engineering Course with DSPy☆214Updated 6 months ago
- An OpenSource Deep Research library with reasoning☆171Updated 2 months ago
- Living memory for AI☆372Updated last month
- ☆265Updated 3 months ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆227Updated 3 months ago
- ☆181Updated 11 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Codebase for FinePDFs☆176Updated last month
- Collection of impressive LLM apps with a focus on the financial sector☆150Updated 3 months ago
- ☆197Updated 6 months ago
- ☆274Updated 3 weeks ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆569Updated 2 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆228Updated 3 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Updated 5 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆135Updated 3 weeks ago
- ☆137Updated 8 months ago
- ☆198Updated 6 months ago
- ☆107Updated 3 months ago
- ☆127Updated 4 months ago