Vaibhavs10 / translate-with-whisper
☆153Updated last year
Alternatives and similar repositories for translate-with-whisper:
Users that are interested in translate-with-whisper are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- ☆269Updated 7 months ago
- whisper.cpp bindings for python☆85Updated last year
- ☆255Updated 10 months ago
- ☆62Updated 6 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆175Updated 5 months ago
- Speaker Diarization with Transformers☆64Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 7 months ago
- ☆65Updated 2 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- ☆348Updated 10 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆140Updated last year
- Collection of Open Source Speech Data☆151Updated 2 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆78Updated last month
- ☆334Updated 4 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆203Updated 3 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆225Updated 2 months ago
- Finetune VITS and MMS using HuggingFace's tools☆131Updated 9 months ago
- ASR + diarization model server with speculative decoding☆53Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Joint speech-language model - respond directly to audio!☆365Updated 6 months ago
- Create an LJSpeech structured voice dataset on wave input☆24Updated 4 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆158Updated 10 months ago
- A testing repo to share code and thoughts on diarisation☆53Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week
- ☆255Updated 7 months ago
- ☆90Updated 9 months ago