Khalil-Rehman9 / CaptionAILinks
A powerful and user-friendly tool that generates detailed captions for your images
☆21Updated 6 months ago
Alternatives and similar repositories for CaptionAI
Users that are interested in CaptionAI are comparing it to the libraries listed below
Sorting:
- ACE-Step: A Step Towards Music Generation Foundation Model☆40Updated 2 weeks ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 7 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆21Updated 2 months ago
- win32 native frontend for llama-cli☆12Updated 7 months ago
- ☆16Updated last year
- Orpheus Chat WebUI☆62Updated 2 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆30Updated last month
- Create text chunks which end at natural stopping points without using a tokenizer☆24Updated 2 months ago
- Run Orpheus 3B Locally With LM Studio☆31Updated 2 months ago
- Extract voice segments of a target speaker from podcasts - Useful for creating speech datasets☆126Updated last week
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆44Updated 2 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆50Updated 8 months ago
- Dou (道) - AI powered analysis and feedback for notes and mind maps☆28Updated last month
- ☆10Updated 2 months ago
- LLM backed Fantasy Tribe Game☆18Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated 9 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆35Updated 2 weeks ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆44Updated 2 months ago
- Writing Extension for Text Generation WebUI☆55Updated 4 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆60Updated 9 months ago
- ☆25Updated 2 months ago
- Personal voice assistant, with voice interruption and Twilio support☆17Updated 3 months ago
- ☆26Updated 2 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 4 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆69Updated 7 months ago
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆74Updated 10 months ago
- Intuitive basic interface for interacting with multiple LLMs at the same time☆46Updated 3 weeks ago
- Simulates Twitch Chat with a locally hosted LLM☆16Updated 7 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year