The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
β138Sep 4, 2025Updated 6 months ago
Alternatives and similar repositories for R-4B
Users that are interested in R-4B are comparing it to the libraries listed below
Sorting:
- π LLM-I: Transform LLMs into natural interleaved multimodal creators! β¨ Tool-use framework supporting image search, generation, code exβ¦β41Oct 20, 2025Updated 4 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"β20Feb 25, 2026Updated last week
- AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead β¦β50Oct 14, 2025Updated 4 months ago
- β14Oct 12, 2024Updated last year
- A NetWork Generate Names, Based On Conditional RNN, Set Condition And Generate Different Names.β12May 15, 2017Updated 8 years ago
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Groupβ76Sep 18, 2025Updated 5 months ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddingsβ39Sep 13, 2025Updated 5 months ago
- Build an MCP agent using Crewaiβ39Jun 8, 2025Updated 8 months ago
- β19Nov 8, 2024Updated last year
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]β183Jun 5, 2025Updated 9 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"β405Jan 29, 2026Updated last month
- β35Mar 8, 2025Updated 11 months ago
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Projectβ23Aug 27, 2020Updated 5 years ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ76Sep 19, 2025Updated 5 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"β117Feb 4, 2026Updated last month
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perceptionβ32Nov 28, 2023Updated 2 years ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tooβ¦β402Aug 26, 2025Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectoriesβ40Aug 7, 2025Updated 7 months ago
- A walk through HuggingFace smolagentsβ49Mar 7, 2025Updated last year
- [CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokensβ246Aug 2, 2025Updated 7 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Giveβ¦β214Oct 12, 2025Updated 4 months ago
- MiMo-VLβ628Aug 21, 2025Updated 6 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983β¦β78Feb 13, 2026Updated 3 weeks ago
- β63Sep 6, 2025Updated 6 months ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Modelsβ235Nov 7, 2025Updated 4 months ago
- Intelligent Document Processing with AWS AI/ML, published by Packtβ12Updated this week
- The purpose of this repository is for devs and non devs to carry out tests on the precompiled botanix artifacts. It contains an easy rpc β¦β13Feb 23, 2026Updated last week
- pix2pix and Cycle GAN architectures for image style transferβ13May 27, 2021Updated 4 years ago
- β30Jun 6, 2023Updated 2 years ago
- ζΊζ §εεΊβ10Aug 3, 2017Updated 8 years ago
- Alternative version of st.camera_input which returns the webcam images live, without any button press neededβ38Aug 4, 2025Updated 7 months ago
- LMTuner: Make the LLM Better for Everyoneβ38Sep 21, 2023Updated 2 years ago
- Code Repository for ControlVLA, CoRL2025.β85Oct 26, 2025Updated 4 months ago
- Source code used in the blogβ12Feb 6, 2024Updated 2 years ago
- β10Jul 13, 2024Updated last year
- Keras Functional API for multiple inputs and mixed dataβ11Feb 18, 2019Updated 7 years ago
- Automate your blogging with AI-powered tools for creating, optimizing, and deploying content. Generate SEO-optimized articles effortlesslβ¦β12Aug 16, 2024Updated last year
- TensorFlow materialsβ13Jan 8, 2021Updated 5 years ago
- β17Aug 5, 2025Updated 7 months ago