lifeiteng / OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―
β734Updated 2 weeks ago
Related projects β
Alternatives and complementary repositories for OmniSenseVoice
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ718Updated 3 months ago
- first base model for full-duplex conversational audioβ1,560Updated last week
- Interface for OuteTTS models.β406Updated 2 weeks ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β695Updated last month
- Whisper with Medusa headsβ799Updated 2 weeks ago
- β446Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,547Updated 3 months ago
- Open source inference code for Rev's modelβ333Updated this week
- Local SRT/LLM/TTS Voicechatβ544Updated last month
- Visualise your CSV files in seconds without sending your data anywhereβ430Updated last week
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.β584Updated 6 months ago
- Implementation of F5-TTS in MLXβ327Updated 2 weeks ago
- Local realtime voice AIβ1,946Updated this week
- An API to transcribe audio with OpenAI's Whisper Large v3!β191Updated last week
- A fast multimodal LLM for real-time voiceβ1,339Updated this week
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.β222Updated 2 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β487Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ262Updated 2 months ago
- Official implementation of the paper "Watermark Anything with Localized Messages"β789Updated this week
- Open-source framework for exporting and building applications off of your personal data.β938Updated this week
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)β580Updated last month
- Semantic Image Search CLI tool.β525Updated 2 months ago
- Detect whether or not an audio file was generated by NotebookLMβ120Updated 3 weeks ago
- StreamSpeech is an βAll in Oneβ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.β958Updated 2 months ago
- Real-time audio to chords, lyrics, beat, and melody.β668Updated 3 months ago
- Create mind maps to learn new things using AI.β478Updated 2 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,183Updated this week
- turnkey self-hosted offline transcription and diarization service with llm summaryβ738Updated last month
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ1,415Updated this week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformerβ238Updated last week