[ICLR 2026] VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
☆87Feb 22, 2026Updated last week
Alternatives and similar repositories for vitabench
Users that are interested in vitabench are comparing it to the libraries listed below
Sorting:
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking☆37Updated this week
- ☆17Aug 5, 2025Updated 6 months ago
- 어린이를 위한 동화 제작 서비스, My AI Fairy-Tale☆11Apr 7, 2023Updated 2 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- ☆29Jan 15, 2026Updated last month
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- 🐰Easy resolving deep json using keypath in Dart☆12Mar 30, 2021Updated 4 years ago
- CustomDynamicType is a versatile Swift library designed to seamlessly integrate custom fonts into iOS Dynamic Type.☆12Dec 5, 2023Updated 2 years ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- Get the element at the specified index only if it is within bounds, otherwise nil☆12Jun 5, 2019Updated 6 years ago
- ☆12Feb 27, 2025Updated last year
- A proof-of-concept implementation of suspend time memory encryption.☆10Feb 26, 2020Updated 6 years ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 2 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Paper Reading Summary(mainly NLP related papers)☆11Nov 6, 2019Updated 6 years ago
- ☆11Feb 19, 2026Updated last week
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- 本项目是July的《程序员编程艺术》的电子书版本☆11Jan 9, 2014Updated 12 years ago
- List of open source projects that are made by Vietnamese engineers☆11Jan 20, 2016Updated 10 years ago
- Swift Implementation of the Model Context Protocol (MCP) Spec☆10Mar 28, 2025Updated 11 months ago
- Implementation of AdaCQR(COLING 2025)☆13Dec 30, 2024Updated last year
- Stanford CS224W: Machine Learning with Graphs (GNN)☆11Sep 6, 2022Updated 3 years ago
- ☆20Dec 3, 2025Updated 3 months ago
- Past and future talks☆12Jan 18, 2021Updated 5 years ago
- Useful Alfred workflows to enhance your productivity on Mac. Great for developers and power user.☆14Mar 22, 2020Updated 5 years ago
- Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, R…☆10Jan 29, 2026Updated last month
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 3 months ago
- ☆16Oct 11, 2025Updated 4 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆39Feb 25, 2026Updated last week
- secureblue's static website☆18Updated this week
- Wireguard linux client (command line and gui)☆14Jan 10, 2026Updated last month
- ☆20Feb 5, 2026Updated 3 weeks ago
- The new software behind openSUSE Paste☆22Oct 2, 2025Updated 5 months ago
- ☆30Feb 19, 2026Updated last week
- Repository for opt-out requests.☆10Mar 25, 2024Updated last year
- Kubernetes Gateway API implementation in Rust☆23Updated this week
- ☆10Nov 15, 2020Updated 5 years ago
- xcconfig file parsing and evaluation☆16Aug 22, 2025Updated 6 months ago