HKUDS / LightAgentLinks
"LightAgent: Lightweight and Cost-Effective Mobile Agents"
☆31Updated last week
Alternatives and similar repositories for LightAgent
Users that are interested in LightAgent are comparing it to the libraries listed below
Sorting:
- Official Repository of "GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration".☆39Updated 7 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆30Updated last week
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆72Updated last month
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated last year
- ☆23Updated 3 months ago
- ☆67Updated 7 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆33Updated 2 weeks ago
- ☆79Updated last year
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆190Updated 3 weeks ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆119Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆19Updated 7 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 8 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆32Updated 3 weeks ago
- ☆55Updated 11 months ago
- ☆40Updated 5 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆40Updated 10 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆60Updated 3 months ago
- ☆11Updated 11 months ago
- Reproducible Language Agent Research☆29Updated 4 months ago
- [ICCV2025] WikiAutoGen offical page☆20Updated 4 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆52Updated 10 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated last month
- ☆50Updated last year
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation, arXiv 2024☆64Updated 2 weeks ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- XmodelLM☆38Updated 11 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated 10 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆29Updated last week
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆20Updated 7 months ago