showlab / macosworldLinks
☆18Updated 2 months ago
Alternatives and similar repositories for macosworld
Users that are interested in macosworld are comparing it to the libraries listed below
Sorting:
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆42Updated 11 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 5 months ago
- ☆67Updated 8 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆34Updated 2 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆87Updated 6 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- ☆72Updated 6 months ago
- ☆27Updated 2 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆35Updated last month
- ☆24Updated last year
- ☆89Updated last month
- Efficient Agent Training for Computer Use☆133Updated 3 months ago
- Resa: Transparent Reasoning Models via SAEs☆45Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆85Updated last week
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 9 months ago
- ☆11Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated 11 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆54Updated last month
- ☆42Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆93Updated 6 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Official repo of paper LM2☆46Updated 10 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆127Updated 3 months ago
- ☆63Updated 5 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆51Updated 8 months ago
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆93Updated 2 months ago