plaggy / fast-whisper-serverLinks
ASR + diarization model server with speculative decoding
☆61Updated last year
Alternatives and similar repositories for fast-whisper-server
Users that are interested in fast-whisper-server are comparing it to the libraries listed below
Sorting:
- ☆175Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- huggingface chat-ui integration with mlx-lm server☆60Updated last year
- ☆101Updated 10 months ago
- VideoDB Python SDK☆74Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆134Updated last year
- ☆87Updated 5 months ago
- All the world is a play, we are but actors in it.☆50Updated this week
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆26Updated 4 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆212Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- ☆74Updated last year
- ☆171Updated 11 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- ☆158Updated 2 years ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆22Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆121Updated last year
- Kyutai with an "eye"☆207Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆75Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆88Updated 2 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆63Updated 9 months ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆240Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Joint speech-language model - respond directly to audio!☆371Updated last year
- ☆205Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 10 months ago