ThanabordeeN / Screenshot_LLMView external linksLinks
Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images.
☆45Nov 4, 2024Updated last year
Alternatives and similar repositories for Screenshot_LLM
Users that are interested in Screenshot_LLM are comparing it to the libraries listed below
Sorting:
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- A password validation and generation tool kit☆14Jan 7, 2023Updated 3 years ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- A powerful and user-friendly tool that generates detailed captions for your images☆21Nov 11, 2024Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Feb 5, 2026Updated last week
- ☆24Jan 22, 2025Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated 3 weeks ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Updated this week
- Capturing Screen Content In MacOS Apple sample code☆18Apr 17, 2024Updated last year
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- ☆15Apr 9, 2025Updated 10 months ago
- Find your favorite Windows spotlight images☆23Jun 29, 2025Updated 7 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 2 months ago
- Local Llama project, L³ is an electron app that runs llama 3 models locally☆17Jun 5, 2025Updated 8 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 10 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- NLP-based Contract Analysis☆12Sep 21, 2017Updated 8 years ago
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- Collaborative AI Model☆11Nov 27, 2024Updated last year
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆27Dec 29, 2025Updated last month
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Oct 3, 2024Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Oct 21, 2025Updated 3 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Jan 27, 2026Updated 2 weeks ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆13Jul 1, 2025Updated 7 months ago
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year
- Brief is a GTK4 application for browsing tldr-pages (community-maintained command line help pages).☆29Feb 2, 2026Updated last week
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- Open source Speechify alternative. Read PDFs and EPUBs with local models.☆35Nov 14, 2025Updated 2 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 7 months ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 4 months ago
- ☆83Feb 28, 2025Updated 11 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated 8 months ago
- Stable Diffusion 3.0 beta Generation GUI for image generation process and automatic save images.☆14Apr 18, 2024Updated last year
- ☆18Aug 19, 2025Updated 5 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year