plaggy / fast-whisper-serverLinks
ASR + diarization model server with speculative decoding
☆63Updated last year
Alternatives and similar repositories for fast-whisper-server
Users that are interested in fast-whisper-server are comparing it to the libraries listed below
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- ☆170Updated last year
- ☆174Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆139Updated last year
- ☆102Updated last year
- ☆31Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- All the world is a play, we are but actors in it.☆50Updated 3 months ago
- ☆207Updated last year
- Kyutai with an "eye"☆223Updated 7 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- Let's create synthetic textbooks together :)☆75Updated last year
- ☆116Updated 11 months ago
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year
- ☆36Updated 2 years ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆245Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- huggingface chat-ui integration with mlx-lm server☆61Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆62Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- VideoDB Python SDK☆84Updated last month