ThanabordeeN / Screenshot_LLMLinks
Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images.
☆42Updated 7 months ago
Alternatives and similar repositories for Screenshot_LLM
Users that are interested in Screenshot_LLM are comparing it to the libraries listed below
Sorting:
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 9 months ago
- Personal voice assistant, with voice interruption and Twilio support☆17Updated 4 months ago
- Dou (道) - AI powered analysis and feedback for notes and mind maps☆28Updated 2 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 4 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆31Updated 3 months ago
- ☆29Updated 8 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Updated 2 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆23Updated last month
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆26Updated 2 weeks ago
- Python language chat with Ollama models locally, anthropic and openai☆25Updated 2 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆36Updated 3 months ago
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆48Updated this week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 4 months ago
- Fast local speech-to-text for any app using faster-whisper☆74Updated 2 months ago
- An API for VoiceCraft.☆25Updated 11 months ago
- Crow is a Desktop AI Assistant☆31Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 3 months ago
- Capture, tag, and search images locally with OSS models.☆42Updated 5 months ago
- Finally, an open source Youtube Summarizer extension☆73Updated 2 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 7 months ago
- A micro LLM multi-agent system for data analysis☆19Updated last month
- A real-time shared memory layer for multi-agent LLM systems.☆21Updated last week
- ☆24Updated 5 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 8 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆32Updated last week
- ☆19Updated 8 months ago
- ☆22Updated 10 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 7 months ago
- Self-hosted AI medical scribe.☆44Updated this week