Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"
☆67May 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for KnowU-Bench
Users that are interested in KnowU-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆54May 5, 2026Updated 2 weeks ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆265Updated this week
- ☆32Aug 11, 2025Updated 9 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 7 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆76Mar 9, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆37Oct 9, 2025Updated 7 months ago
- [ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆79Nov 10, 2025Updated 6 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 11 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆45Aug 10, 2025Updated 9 months ago
- ☆28Aug 19, 2025Updated 9 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆418May 13, 2026Updated last week
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 10 months ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆113Updated this week
- 我2022年春夏学期修读离散数学及其应用时用语雀做的笔记,现在导出放在GitHub上给大家看,希望能帮学弟学妹们省点做笔记的时间。☆12Mar 5, 2023Updated 3 years ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆43Jan 28, 2026Updated 3 months ago
- some small but usuful scripts that help you with RK35588 or other Rockchips☆10May 17, 2023Updated 3 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- ☆18May 11, 2025Updated last year
- Tool to convert and import problems from Polygon into DOMjudge.☆35Mar 19, 2025Updated last year
- Materials for Discrete Mathematics and Its Applications by Kenneth H. Rosen☆32Sep 18, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Mar 13, 2025Updated last year
- ICLR 2026☆42May 12, 2026Updated last week
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆12Aug 1, 2023Updated 2 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆14Nov 1, 2025Updated 6 months ago
- GroundCUA☆126Mar 24, 2026Updated last month
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆98Oct 23, 2025Updated 6 months ago
- ☆15Jun 6, 2023Updated 2 years ago
- ☆32Jun 13, 2025Updated 11 months ago
- 整理的题目与数据☆32Aug 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official code of [ICLR 2026] TFPI: Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient …☆103Jan 27, 2026Updated 3 months ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)☆14Nov 22, 2023Updated 2 years ago
- 🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.☆11Aug 29, 2021Updated 4 years ago
- Codes for our paper "Enhancing Continual Relation Extraction via Classifier Decomposition" (Findings of ACL2023)☆10Nov 29, 2023Updated 2 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- ☆15May 13, 2026Updated last week
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 7 months ago