[ICLR2026] Test-Time Scaling with Reflective Generative Model
☆301Jan 28, 2026Updated last month
Alternatives and similar repositories for XBai-o4
Users that are interested in XBai-o4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple & Scalable Pretraining for Neural Architecture Research☆309Dec 6, 2025Updated 3 months ago
- The open-source code of MetaStone-S1.☆106Aug 1, 2025Updated 7 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 6 months ago
- ☆19Mar 3, 2025Updated last year
- ☆39Aug 4, 2025Updated 7 months ago
- gpt-oss + voice-ui-kit experiment☆151Aug 6, 2025Updated 7 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 8 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 5 months ago
- Portal: GUI Tools for Agents☆25Sep 18, 2025Updated 6 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- MB-X.01 · Logical Origin Node (L.O.N.) — TruthΩ → Co⁺ → Score⁺. Demo e spec verificabili. https://massimiliano.neocities.org/☆65Feb 3, 2026Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated last month
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- ☆68May 26, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- A simple external application for Windows that allows you to scan an existing custom_nodes directory and generate a list of the nodes ins…☆20Jul 6, 2025Updated 8 months ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,108Jul 16, 2025Updated 8 months ago
- Amazon Bedrock AgentCore – Multi Framework Examples☆44Sep 24, 2025Updated 6 months ago
- ☆19May 17, 2025Updated 10 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Aug 18, 2024Updated last year
- Code for the paper Don't Pay Attention☆55Sep 25, 2025Updated 5 months ago
- LisanBench is a lightweight benchmark for LLMs that stresses forward planning, vocabulary depth, constraint adherence, attention, and lon…☆31Jun 1, 2025Updated 9 months ago
- This is a simple guide to help you build an Anthropic Claude Sonnet 3.5 chatbot interface with Gradio☆12Jun 23, 2024Updated last year
- ☆40Oct 2, 2025Updated 5 months ago
- ☆17Jan 29, 2026Updated last month
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆1,023Mar 3, 2026Updated 3 weeks ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆96Mar 5, 2026Updated 2 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆312Oct 13, 2025Updated 5 months ago
- ☆16Apr 30, 2025Updated 10 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 7 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,644Apr 8, 2025Updated 11 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆925Feb 28, 2026Updated 3 weeks ago
- Model code for inferencing T5☆66Mar 10, 2025Updated last year
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 3 months ago
- Language modeling with linear-cost context☆119Sep 25, 2025Updated 5 months ago
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago