AutoLab-SAI-SJTU / AutoPageLinks
This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.
☆150Updated last month
Alternatives and similar repositories for AutoPage
Users that are interested in AutoPage are comparing it to the libraries listed below
Sorting:
- ☆449Updated this week
- Official Repository for PosterGen☆201Updated 3 weeks ago
- A Scientific Multimodal Foundation Model☆620Updated 2 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆608Updated last week
- This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).☆249Updated last week
- The paper list of "Memory in the Age of AI Agents: A Survey"☆507Updated this week
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆515Updated 3 weeks ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆539Updated last month
- 🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal rei…☆191Updated 2 weeks ago
- A reproduction of the Deepseek-OCR model including training☆200Updated last month
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆363Updated last month
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆246Updated 2 months ago
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆335Updated 6 months ago
- AgentFlow: In-the-Flow Agentic System Optimization☆1,425Updated last week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆508Updated 3 months ago
- ☆759Updated 2 months ago
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆874Updated last month
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆245Updated 5 months ago
- A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)☆420Updated 2 months ago
- [EMNLP 2025] Awesome RAG Reasoning Resources☆369Updated 5 months ago
- [ACL 2025] GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent☆57Updated 6 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆710Updated last week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆476Updated last month
- ☆1,164Updated 2 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆537Updated 3 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆368Updated 3 months ago
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆48Updated 3 weeks ago
- Fully Open Framework for Democratized Multimodal Training☆662Updated last week
- The development and future prospects of large multimodal reasoning models.☆561Updated 4 months ago
- "OpenPhone: Mobile Agentic Foundation Models for AI Phone"☆353Updated last week