ThanabordeeN / Screenshot_LLM
Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images.
☆40Updated 5 months ago
Alternatives and similar repositories for Screenshot_LLM:
Users that are interested in Screenshot_LLM are comparing it to the libraries listed below
- ☆22Updated 8 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- An API for VoiceCraft.☆25Updated 9 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- Personal voice assistant, with voice interruption and Twilio support☆17Updated last month
- ☆24Updated 2 months ago
- ☆17Updated 4 months ago
- Allows two LLMs to communicate and run code in the terminal☆23Updated 4 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Updated 7 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆44Updated last year
- A unified library for interacting with various AI APIs through a standardized interface.☆29Updated last month
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆117Updated 5 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆37Updated 2 months ago
- Crow is a Desktop AI Assistant☆32Updated 8 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated last month
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆29Updated last month
- ☆29Updated 6 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Updated last week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆65Updated 5 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 9 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated 9 months ago
- OLLama IMage CAtegorizer☆66Updated 3 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 7 months ago
- ☆68Updated last month
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 10 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆22Updated 2 weeks ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆18Updated last month
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆33Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 7 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month