TheStageAI / TheWhisperLinks
Optimized Whisper models for streaming and on-device use
☆811Updated this week
Alternatives and similar repositories for TheWhisper
Users that are interested in TheWhisper are comparing it to the libraries listed below
Sorting:
- TTS model capable of streaming conversational audio in realtime.☆1,027Updated 2 months ago
- ☆502Updated this week
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,009Updated this week
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆336Updated this week
- ☆694Updated 4 months ago
- An open source web crawler that searches the internet☆247Updated 4 months ago
- AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning☆520Updated 2 months ago
- Make text LLMs listen and speak☆1,133Updated last week
- In this repo there are some tips and a template to train your YOLO model for any kind of computer vision application☆334Updated 2 weeks ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,100Updated 2 weeks ago
- advanced, scalable, no-code RAG☆325Updated last week
- ☆637Updated 2 months ago
- ComfyDeployed☆439Updated 4 months ago
- An OS for your agents, built for your pocket.☆795Updated 3 months ago
- An open-source implementation of Whisper☆475Updated 3 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Updated 5 months ago
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- Open-source framework for developing real-time multimodal conversational AI agents.☆587Updated this week
- Diagram generation for understanding codebases and system architecture using Nano Banana Pro.☆575Updated 2 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,077Updated last month
- Nanobanana fal AI powered Photoshop-esque Studio☆333Updated 2 months ago
- Living memory for AI☆335Updated 3 weeks ago
- Semantic search and document parsing tools for the command line☆1,572Updated this week
- ☆431Updated last month
- AirLLM 70B inference with single 4GB GPU☆1,908Updated 4 months ago
- ☆320Updated 2 weeks ago
- ☆318Updated last month
- A personal AI assistant for everyone☆666Updated last week
- An agentic Machine Learning Engineer☆1,189Updated last month
- ☆967Updated last month