AutoLab-SAI-SJTU / AutoPageLinks
This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.
☆155Updated 3 months ago
Alternatives and similar repositories for AutoPage
Users that are interested in AutoPage are comparing it to the libraries listed below
Sorting:
- ☆507Updated last week
- Official Repository for PosterGen☆209Updated 3 weeks ago
- This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).☆270Updated last week
- A Scientific Multimodal Foundation Model☆629Updated 4 months ago
- Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.☆47Updated 3 weeks ago
- OpenCUA: Open Foundations for Computer-Use Agents☆661Updated 2 weeks ago
- A reproduction of the Deepseek-OCR model including training☆206Updated 2 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆558Updated 3 months ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆535Updated 2 months ago
- ☆82Updated last month
- AgentEvolver: Towards Efficient Self-Evolving Agent System☆1,109Updated last week
- The paper list of "Memory in the Age of AI Agents: A Survey"☆1,078Updated last week
- ☆112Updated 3 months ago
- ☆137Updated 2 weeks ago
- Fully Open Framework for Democratized Multimodal Training☆710Updated last month
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆376Updated 3 months ago
- ☆849Updated 3 months ago
- Survey and paper list on efficiency-guided LLM agents (memory, tool learning, planning).☆122Updated this week
- [ACL 2025] GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent☆58Updated 8 months ago
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆349Updated 8 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆255Updated 2 months ago
- MemEvolve & EvolveLab☆148Updated last month
- ☆1,222Updated 3 months ago
- The development and future prospects of large multimodal reasoning models.☆582Updated 3 weeks ago
- AgentFlow: In-the-Flow Agentic System Optimization☆1,543Updated last week
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆577Updated 4 months ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆311Updated last week
- TPAMI 2026 | This repository collects awesome survey, resource, and paper for lifelong learning LLM agents☆268Updated 3 weeks ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆250Updated 4 months ago
- [WWW 2026] 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆958Updated 3 weeks ago