tsinghua-fib-lab/SmartAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tsinghua-fib-lab/SmartAgent)

tsinghua-fib-lab / SmartAgent

The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".

☆27

Alternatives and similar repositories for SmartAgent

Users that are interested in SmartAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCSB-AI / Screen-Point-and-Read
View on GitHub
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
☆31May 12, 2026Updated 2 months ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
xbmxb / EnvDistraction
View on GitHub
☆24Oct 11, 2024Updated last year
junchen-fu / DIGER
View on GitHub
Differentiable Semantic ID for Generative Recommendation
☆47Jun 8, 2026Updated last month
921112343 / GUI-Xplore
View on GitHub
[CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
☆21Mar 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
langfengQ / CoSo
View on GitHub
Official code for paper "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"
☆15Jun 12, 2025Updated last year
yuzhu-cai / rSDE-Bench
View on GitHub
☆36May 29, 2025Updated last year
THUDM / Self-Contrast
View on GitHub
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
☆20Apr 2, 2024Updated 2 years ago
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 5 months ago
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
SongYYYY / KDD22-OODGAT
View on GitHub
This is the implementation of OODGAT from KDD'22: Learning on Graphs with Out-of-Distribution Nodes.
☆23Sep 2, 2022Updated 3 years ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
intelligolabs / CoIN
View on GitHub
[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…
☆31Apr 1, 2026Updated 3 months ago
alenai97 / PEFT-MLLM
View on GitHub
Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"
☆25Nov 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
google-research-datasets / screen2words
View on GitHub
The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of …
☆67Jul 27, 2021Updated 5 years ago
Cranial-XIX / marl-copa
View on GitHub
PyTorch Implementation of COPA for coordinating teams that can dynamically change.
☆24Apr 16, 2022Updated 4 years ago
declare-lab / Emma-X
View on GitHub
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆84May 17, 2025Updated last year
SceneDroid / SceneDroid
View on GitHub
☆17Oct 30, 2023Updated 2 years ago
cambridgeltl / topviewrs
View on GitHub
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)
☆15Jun 14, 2025Updated last year
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 6 months ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
aaron-wheeler / MarketGPT
View on GitHub
MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series
☆19Sep 5, 2025Updated 10 months ago
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
Neon-Jing / Guider
View on GitHub
[WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…
☆14Oct 14, 2025Updated 9 months ago
fffstrong / RoboWM-Bench
View on GitHub
☆37Updated this week
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆31Dec 12, 2024Updated last year
Ganvin-Li / AlldayWalker
View on GitHub
🎉 [ICLR 2026] All-Day Multi-Scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation
☆37Jun 29, 2026Updated last month
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
robiemusketeer / faea-sim
View on GitHub
☆18Jan 29, 2026Updated 6 months ago
PhoneHarness / PhoneHarness
View on GitHub
PhoneHarness runtime harness for mixed-action phone agents
☆35Jun 17, 2026Updated last month
yulin-luo / RoboBench
View on GitHub
This is the official evaluation code for Robobench
☆22Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LCS2-IIITD / DaSLaM
View on GitHub
☆17Oct 31, 2023Updated 2 years ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
McGill-NLP / agent-reward-bench
View on GitHub
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆48Aug 7, 2025Updated 11 months ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
Ten-Mao / DiscRec
View on GitHub
The implementation for the work "DiscRec: Disentangled Semantic–Collaborative Modeling for Generative Recommendation".
☆16Jul 13, 2025Updated last year
Liuxinyv / HiPrompt
View on GitHub
[IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts
☆26Feb 28, 2025Updated last year
UMass-Embodied-AGI / CHAIC
View on GitHub
[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…
☆25May 2, 2025Updated last year