Troyanovsky / llama-vision-image-taggerLinks
Use Llama3.2 Vision for tagging and searching images on your local machine.
☆74Updated 6 months ago
Alternatives and similar repositories for llama-vision-image-tagger
Users that are interested in llama-vision-image-tagger are comparing it to the libraries listed below
Sorting:
- Automated speech dataset creator☆159Updated last month
- Ollama client written in Python☆4Updated 7 months ago
- Creates an index of images, queries a local LLM and adds tags to the image metadata☆234Updated last month
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated 9 months ago
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆199Updated 6 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆432Updated 2 weeks ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆192Updated 2 months ago
- ☆80Updated 4 months ago
- ☆72Updated 2 months ago
- Orpheus Chat WebUI☆69Updated 3 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆197Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆102Updated 3 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆258Updated 4 months ago
- A python script designed to translate large amounts of text with an LLM and the Ollama API☆103Updated 3 weeks ago
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆164Updated last month
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 5 months ago
- AI Powered search tool offers content-based, text, and visual similarity system-wide search.☆256Updated last month
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆246Updated 5 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆28Updated 4 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆232Updated 6 months ago
- Easy to use interface for the Whisper model optimized for all GPUs!☆241Updated 3 weeks ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions☆330Updated 3 weeks ago
- ☆186Updated 3 months ago
- Code for Papeg.ai☆225Updated 6 months ago
- Agent MCP for ffmpeg☆196Updated last month
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆293Updated last month
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆195Updated 3 months ago
- ☆108Updated 2 months ago
- Using Gemma-3 Vision☆93Updated 3 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆173Updated this week