yannqi / R-4BView external linksLinks
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
β136Sep 4, 2025Updated 5 months ago
Alternatives and similar repositories for R-4B
Users that are interested in R-4B are comparing it to the libraries listed below
Sorting:
- π LLM-I: Transform LLMs into natural interleaved multimodal creators! β¨ Tool-use framework supporting image search, generation, code exβ¦β41Oct 20, 2025Updated 3 months ago
- β14Oct 12, 2024Updated last year
- A NetWork Generate Names, Based On Conditional RNN, Set Condition And Generate Different Names.β12May 15, 2017Updated 8 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.β20Dec 2, 2024Updated last year
- This is a framework for evaluating reasoning in foundational Video Models.β49Updated this week
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Groupβ76Sep 18, 2025Updated 4 months ago
- Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".β147Sep 21, 2025Updated 4 months ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddingsβ39Sep 13, 2025Updated 5 months ago
- Build an MCP agent using Crewaiβ39Jun 8, 2025Updated 8 months ago
- Scaling Long-Horizon LLM Agent via Context-Foldingβ112Jan 26, 2026Updated 2 weeks ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Timeβ88Jun 10, 2025Updated 8 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]β182Jun 5, 2025Updated 8 months ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputsβ23Oct 15, 2024Updated last year
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"β401Jan 29, 2026Updated 2 weeks ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ75Sep 19, 2025Updated 4 months ago
- [CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthinessβ443May 14, 2025Updated 9 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"β113Feb 4, 2026Updated last week
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tooβ¦β392Aug 26, 2025Updated 5 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983β¦β69Jan 28, 2026Updated 2 weeks ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.β238May 14, 2025Updated 9 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Giveβ¦β209Oct 12, 2025Updated 4 months ago
- MiMo-VLβ623Aug 21, 2025Updated 5 months ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimizationβ61Sep 19, 2025Updated 4 months ago
- vLLM performance dashboardβ41Apr 26, 2024Updated last year
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Modelsβ233Nov 7, 2025Updated 3 months ago
- Intelligent Document Processing with AWS AI/ML, published by Packtβ10Feb 5, 2026Updated last week
- The code repository of UniRLβ51May 30, 2025Updated 8 months ago
- LMTuner: Make the LLM Better for Everyoneβ38Sep 21, 2023Updated 2 years ago
- Code Repository for ControlVLA, CoRL2025.β84Oct 26, 2025Updated 3 months ago
- Docker For Python Machine learningβ11Jan 11, 2023Updated 3 years ago
- Code for my collection of predictors/classifiers/etcβ14Jul 18, 2024Updated last year
- This is a collection of recent papers on reasoning in video generation models.β95Jan 8, 2026Updated last month
- Keras Functional API for multiple inputs and mixed dataβ11Feb 18, 2019Updated 6 years ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detectionβ16Mar 23, 2025Updated 10 months ago
- β10Jul 2, 2021Updated 4 years ago
- β22Dec 11, 2025Updated 2 months ago
- Python-based Legal Advisor harnesses the power of advanced Language Models for comprehensive legal guidance π. Uses LLM internally to giβ¦β13Mar 3, 2024Updated last year
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.β12Nov 27, 2024Updated last year
- Watsonx Assistant with Milvus as Vector Databaseβ12Mar 31, 2025Updated 10 months ago