Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"
☆51Apr 10, 2026Updated this week
Alternatives and similar repositories for KnowU-Bench
Users that are interested in KnowU-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆51Feb 12, 2026Updated 2 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆97Apr 3, 2026Updated last week
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆63Nov 8, 2025Updated 5 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆72Mar 9, 2026Updated last month
- ☆37Oct 9, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 5 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 10 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 6 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆43Aug 10, 2025Updated 8 months ago
- ☆28Aug 19, 2025Updated 7 months ago
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 8 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆394Updated this week
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- 我2022年春夏学期修读离散数学及其应用时用语雀做的笔记,现在导出放在GitHub上给大家看,希望能帮学弟学妹们省点做笔记的时间。☆12Mar 5, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 美赛爬虫,美国大学生数学建模竞赛证书爬取及信息OCR识别分析☆17Jun 25, 2022Updated 3 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- ☆18May 11, 2025Updated 11 months ago
- ☆39Aug 28, 2025Updated 7 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 5 months ago
- The official code of [ICLR 2026] TFPI: Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient …☆103Jan 27, 2026Updated 2 months ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆125Apr 1, 2026Updated last week
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆305Feb 2, 2026Updated 2 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆76Dec 17, 2025Updated 3 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Lear…☆49Jun 11, 2025Updated 10 months ago
- ☆129Oct 3, 2025Updated 6 months ago
- Opens the current xcworkspace / xcproject in AppCode.☆38Sep 22, 2016Updated 9 years ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 10 months ago
- ☆10Aug 7, 2024Updated last year
- Media around Buildbot - images, slides, papers, etc.☆14Oct 6, 2019Updated 6 years ago
- ☆16Oct 21, 2022Updated 3 years ago
- ☆19Oct 22, 2025Updated 5 months ago
- Kidash: A GrimoireLab tool & library to manage Kibana/Kibiter visualizations and dashboards☆13Mar 3, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Easy and Efficient dLLM Fine-Tuning☆239Mar 2, 2026Updated last month
- A collection of development container 'features' for machine learning and data science☆11Feb 19, 2026Updated last month
- Software for building the IR Anthology.☆11Sep 19, 2023Updated 2 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆18Mar 19, 2026Updated 3 weeks ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆123Oct 16, 2025Updated 5 months ago
- ☆13Jan 12, 2026Updated 2 months ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆230Apr 1, 2026Updated last week