X-PLUG / OSWorld-MCPLinks
☆207Updated last month
Alternatives and similar repositories for OSWorld-MCP
Users that are interested in OSWorld-MCP are comparing it to the libraries listed below
Sorting:
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 3 months ago
- ☆110Updated 2 weeks ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆244Updated 4 months ago
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆97Updated 3 weeks ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 10 months ago
- ☆198Updated 3 months ago
- ☆359Updated 7 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 5 months ago
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆145Updated last month
- ☆128Updated 3 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last month
- ☆207Updated 8 months ago
- ☆104Updated 7 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆344Updated last month
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆311Updated last month
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆117Updated 3 weeks ago
- ☆33Updated 2 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆164Updated 2 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆128Updated 2 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆170Updated last year
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆54Updated 9 months ago
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://openreview.net/pdf?id=SlRtFwBdzP☆163Updated 4 months ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆98Updated 10 months ago
- ☆38Updated 9 months ago
- NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents☆135Updated last month
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)☆33Updated last year
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 6 months ago
- Official Implementation of FastMCTS: A Simple Sampling Strategy for Data Synthesis☆112Updated 6 months ago
- [ACL 2025] FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation☆62Updated 7 months ago