ai-dock / base-imageLinks
Base image to be extended by all other ai-dock images
☆33Updated last year
Alternatives and similar repositories for base-image
Users that are interested in base-image are comparing it to the libraries listed below
Sorting:
- Eternal is an experimental platform for machine learning models and workflows.☆67Updated 7 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 6 months ago
- An API for VoiceCraft.☆25Updated last year
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆66Updated 6 months ago
- API server for Instant voice cloning by MyShell.☆103Updated last year
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆37Updated 6 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆44Updated 4 months ago
- AI Media processing using ComfyUI☆158Updated this week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆117Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 6 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 11 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆259Updated 7 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 11 months ago
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆30Updated 6 months ago
- ☆24Updated 8 months ago
- ☆132Updated 5 months ago
- automatically quant GGUF models☆210Updated last week
- ☆51Updated 11 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆57Updated 10 months ago
- ☆91Updated 4 months ago
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆33Updated last year
- SoTA open-source TTS☆100Updated 3 weeks ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated 11 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆48Updated 3 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆17Updated 8 months ago
- A open webui function for better R1 experience☆78Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 5 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆137Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆45Updated 8 months ago