plaggy / fast-whisper-serverLinks
ASR + diarization model server with speculative decoding
☆63Updated last year
Alternatives and similar repositories for fast-whisper-server
Users that are interested in fast-whisper-server are comparing it to the libraries listed below
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- ☆175Updated 2 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year
- ☆206Updated last year
- All the world is a play, we are but actors in it.☆49Updated 6 months ago
- ☆92Updated 11 months ago
- ☆119Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Kyutai with an "eye"☆234Updated 9 months ago
- ☆170Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆216Updated 2 months ago
- ☆157Updated 2 years ago
- huggingface chat-ui integration with mlx-lm server☆62Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆77Updated last year
- Joint speech-language model - respond directly to audio!☆371Updated last year
- ☆67Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆37Updated 2 years ago
- ☆101Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated last year
- VideoDB Python SDK☆87Updated this week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated 2 years ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆68Updated 2 years ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Let's create synthetic textbooks together :)☆75Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year