rickyang1114 / multimodal-deepresearcherLinks
Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework
☆34Updated 4 months ago
Alternatives and similar repositories for multimodal-deepresearcher
Users that are interested in multimodal-deepresearcher are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- ☆32Updated 5 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆51Updated last month
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆33Updated last month
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆32Updated 3 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Updated 2 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Updated 5 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 9 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Updated last year
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆34Updated 2 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Updated 6 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Updated 10 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆20Updated last year
- ☆16Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Updated 5 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆45Updated 3 months ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Updated 4 months ago
- Code implementation for DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning.☆42Updated 3 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆129Updated 3 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 4 months ago
- ☆50Updated 6 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 9 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆52Updated last week
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆36Updated 2 weeks ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆67Updated 7 months ago
- Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆29Updated last week