Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆345Mar 16, 2026Updated last month
Alternatives and similar repositories for the-markovian-thinker
Users that are interested in the-markovian-thinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Public site of Russian-speaking AGI community☆13Updated this week
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 7 months ago
- [EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation☆19Dec 11, 2024Updated last year
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reinforcement Learning from Text Feedback☆33Feb 17, 2026Updated 2 months ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- ☆54Feb 19, 2025Updated last year
- ☆29Aug 25, 2024Updated last year
- a jax benchmark for ad hoc teamwork☆21Updated this week
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated last year
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 11 years ago
- This is a PyTorch implementation of a Transformer Decoder based model that plays chess.☆17Mar 15, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆376Mar 30, 2026Updated 2 weeks ago
- The Programmers Open Workbench☆13Dec 19, 2011Updated 14 years ago
- Reusable Oberon-2 modules☆12Dec 24, 2025Updated 3 months ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆17Nov 18, 2025Updated 5 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆27Mar 30, 2026Updated 2 weeks ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 8 months ago
- [NO LONGER MAINTAINED, SUPERSEDED BY https://github.com/trueagi-io/pln-experimental and https://github.com/trueagi-io/PLN]. Probabilisti…☆16Sep 20, 2025Updated 6 months ago
- ☆17Jun 8, 2025Updated 10 months ago
- This repository contains the source codes for the paper: "SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environm…☆16Oct 11, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆42Jul 24, 2025Updated 8 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆25Mar 6, 2026Updated last month
- A standard library for Oberon-2☆13Sep 2, 2014Updated 11 years ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- ☆15Jun 18, 2015Updated 10 years ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 2 months ago
- ☆24Sep 25, 2025Updated 6 months ago
- Optimizing Oberon-2 Compiler☆17Apr 5, 2016Updated 10 years ago
- [ACL 2026] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆60Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A curated list of awesome Machine Learning frameworks, libraries and software. With repository stars⭐ and forks🍴☆17Updated this week
- ☆14Oct 28, 2023Updated 2 years ago
- ☆10Feb 22, 2022Updated 4 years ago
- Generic heap allocation procedures Memory.New(ptr, size) for the Project Oberon 2013 operating system☆12Nov 4, 2024Updated last year
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Jun 12, 2023Updated 2 years ago
- Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"☆33Sep 28, 2025Updated 6 months ago
- ☆19May 11, 2023Updated 2 years ago