☆1,263Feb 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for m3-agent
Users that are interested in m3-agent are comparing it to the libraries listed below
Sorting:
- ☆28Jan 5, 2026Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆910Jul 31, 2025Updated 7 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆405Jan 29, 2026Updated last month
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆141Aug 21, 2025Updated 6 months ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆18,337Feb 27, 2026Updated last week
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆2,296Oct 5, 2025Updated 5 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,519Updated this week
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆12May 5, 2025Updated 10 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆2,201Jan 27, 2026Updated last month
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆88Jun 10, 2025Updated 8 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,586Updated this week
- Open-source unified multimodal model☆5,704Oct 27, 2025Updated 4 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆309Oct 13, 2025Updated 4 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,249Aug 16, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆1,551Jun 14, 2025Updated 8 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆648Feb 27, 2026Updated last week
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆608Feb 15, 2026Updated 2 weeks ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆77Jul 30, 2025Updated 7 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,591Feb 14, 2026Updated 3 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Feb 27, 2026Updated last week
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆78Feb 13, 2026Updated 3 weeks ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,649Feb 26, 2026Updated last week
- The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"☆802Dec 28, 2025Updated 2 months ago
- A Collection of Papers about Memory for Language Agents☆358Jan 21, 2026Updated last month
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,936Jun 12, 2025Updated 8 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆425Jun 20, 2025Updated 8 months ago
- User Profile-Based Long-Term Memory for AI Chatbot Applications.☆2,578Jan 11, 2026Updated last month
- Mobile-Agent: The Powerful GUI Agent Family☆7,971Updated this week
- ✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,494Mar 28, 2025Updated 11 months ago
- Structured Video Comprehension of Real-World Shorts☆231Sep 21, 2025Updated 5 months ago
- ☆139Nov 17, 2025Updated 3 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,854Sep 22, 2025Updated 5 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,887Jan 8, 2026Updated last month
- AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolut…☆6,114Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,451Feb 16, 2026Updated 2 weeks ago