ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!
☆140Sep 30, 2025Updated 5 months ago
Alternatives and similar repositories for screensuite
Users that are interested in screensuite are comparing it to the libraries listed below
Sorting:
- Repository for opt-out requests.☆10Mar 25, 2024Updated last year
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year
- Simple Chainlit UI for running llms from Groq and LangChain☆17Feb 28, 2024Updated 2 years ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Jul 27, 2023Updated 2 years ago
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 5 months ago
- A typescript demo using react and the Streaming Avatar SDK☆17Jul 2, 2024Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- Commander, your AI coding commander centre for all you ai coding cli agents☆50Oct 10, 2025Updated 4 months ago
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Sep 27, 2023Updated 2 years ago
- A Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.☆36Oct 31, 2025Updated 4 months ago
- ☆123Apr 21, 2023Updated 2 years ago
- ☆31Jul 3, 2025Updated 8 months ago
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last month
- ☆20Apr 24, 2024Updated last year
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- ☆53Aug 5, 2025Updated 7 months ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆30Feb 27, 2026Updated last week
- Benchmarking Benchmark Leakage in Large Language Models☆60May 20, 2024Updated last year
- ☆25Jul 24, 2024Updated last year
- ☆63Dec 29, 2025Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Dec 10, 2024Updated last year
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆33Updated this week
- ☆40Jul 15, 2025Updated 7 months ago
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- The theory of mind module for the SWE agent☆82Jan 13, 2026Updated last month
- Evaluation framework for document processing models and services.☆64Feb 12, 2026Updated 3 weeks ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆32Sep 20, 2024Updated last year
- Code and training scripts for FlexOlmo☆126Feb 27, 2026Updated last week
- cbReader - A simple web-based comic book reader (CBZ/CBR)☆10May 21, 2018Updated 7 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆35Oct 7, 2024Updated last year
- A simple dify bot☆34Apr 16, 2025Updated 10 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago