SamsungLabs / TinyClick
TinyClick: Single-Turn Agent for Empowering GUI Automation
☆29Updated 4 months ago
Alternatives and similar repositories for TinyClick:
Users that are interested in TinyClick are comparing it to the libraries listed below
- ☆36Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- ☆51Updated 6 months ago
- ☆53Updated 8 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 8 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 7 months ago
- Code for ScribeAgent paper☆50Updated last month
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆110Updated 7 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 2 months ago
- ☆24Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆48Updated 2 weeks ago
- ☆70Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆141Updated last week
- Own your AI, search the web with it🌐😎☆79Updated last month
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆110Updated 9 months ago
- An AI agent for interacting with a computer using the graphical user interface☆75Updated last year
- ☆111Updated 2 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆98Updated 2 months ago
- Open Agent Computer Interface☆57Updated 2 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆111Updated 3 months ago
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆85Updated 3 weeks ago
- Official Code for Oᴘᴇɴ-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)☆100Updated 3 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆168Updated this week
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- ☆114Updated 6 months ago