ThanabordeeN / Screenshot_LLMLinks
Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images.
☆42Updated 7 months ago
Alternatives and similar repositories for Screenshot_LLM
Users that are interested in Screenshot_LLM are comparing it to the libraries listed below
Sorting:
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 8 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated 9 months ago
- Crow is a Desktop AI Assistant☆32Updated 9 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 8 months ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆25Updated this week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 3 months ago
- Personal voice assistant, with voice interruption and Twilio support☆17Updated 3 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 7 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆24Updated 3 weeks ago
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 2 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44Updated last week
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆29Updated 2 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆48Updated 3 months ago
- Capture, tag, and search images locally with OSS models.☆41Updated 4 months ago
- A micro LLM multi-agent system for data analysis☆18Updated last month
- An API for VoiceCraft.☆25Updated 11 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆29Updated 2 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆40Updated 4 months ago
- ☆17Updated 5 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆21Updated 2 months ago
- Allows two LLMs to communicate and run code in the terminal☆24Updated 5 months ago
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆34Updated 7 months ago
- ☆24Updated 4 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆30Updated 3 months ago
- ☆29Updated last month
- High level tool use for LLMs☆34Updated 10 months ago
- LLM backed Fantasy Tribe Game☆18Updated 6 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆30Updated last month