ThanabordeeN / Screenshot_LLM
Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images.
☆41Updated 6 months ago
Alternatives and similar repositories for Screenshot_LLM:
Users that are interested in Screenshot_LLM are comparing it to the libraries listed below
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- Personal voice assistant, with voice interruption and Twilio support☆17Updated 2 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆19Updated last week
- ☆28Updated 7 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆65Updated 6 months ago
- ☆75Updated 2 months ago
- An API for VoiceCraft.☆25Updated 10 months ago
- Fast local speech-to-text for any app using faster-whisper☆68Updated last month
- Crow is a Desktop AI Assistant☆32Updated 8 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆20Updated this week
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated last month
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆39Updated 3 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆29Updated last month
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆29Updated 2 months ago
- ☆17Updated 4 months ago
- Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native su…☆21Updated this week
- ☆16Updated this week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆28Updated 2 months ago
- ☆19Updated 7 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated 8 months ago
- ☆29Updated 2 weeks ago
- Analyze Reddit posts☆26Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆58Updated this week
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Updated 2 weeks ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆19Updated 3 weeks ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆20Updated last month
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆30Updated 2 months ago