☆23Oct 11, 2024Updated last year
Alternatives and similar repositories for EnvDistraction
Users that are interested in EnvDistraction are comparing it to the libraries listed below
Sorting:
- ☆31Sep 27, 2024Updated last year
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆99Jan 11, 2026Updated last month
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year
- [AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking☆13Apr 22, 2025Updated 10 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- ☆37Oct 15, 2024Updated last year
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆18Jun 19, 2025Updated 8 months ago
- ☆70Feb 4, 2024Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- ☆35Jun 20, 2024Updated last year
- [WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering☆15Apr 22, 2025Updated 10 months ago
- ☆17Sep 4, 2024Updated last year
- ☆18Aug 15, 2022Updated 3 years ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- ☆20Feb 11, 2024Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- The source code of Paper "PathQG: Neural Question Generation from Facts".☆23Jan 4, 2021Updated 5 years ago
- VisionDroid☆21Apr 2, 2024Updated last year
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 2 years ago
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆102Jul 18, 2025Updated 7 months ago
- ☆19Mar 9, 2024Updated last year
- SLIME is a novel program-sensitive fuzzer that designs multiple property-aware queues and leverages a customized Upper Confidence Bound V…☆20Feb 23, 2023Updated 3 years ago
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Dec 9, 2021Updated 4 years ago
- [ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification☆42Jan 21, 2026Updated last month
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- ☆37Oct 17, 2024Updated last year
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Oct 6, 2023Updated 2 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- FPGA Low latency 10GBASE-R PCS☆12May 23, 2023Updated 2 years ago
- Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"☆46May 16, 2025Updated 9 months ago
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆41Jun 4, 2024Updated last year
- Structured Chemistry Reasoning with Large Language Models☆39May 4, 2024Updated last year
- ☆10Jun 5, 2023Updated 2 years ago
- AI-WordCards is an innovative project that leverages the power of GPT, StableDiffusion, and DALL-E3 to create educational and engaging wo…☆10May 16, 2024Updated last year
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Updated this week
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆19Updated this week