Vaibhavs10 / translate-with-whisperLinks
☆158Updated 2 years ago
Alternatives and similar repositories for translate-with-whisper
Users that are interested in translate-with-whisper are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆371Updated last year
- ☆62Updated last year
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- ☆262Updated last year
- ☆307Updated last year
- Speaker Diarization with Transformers☆69Updated 2 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- ☆359Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- ☆206Updated last year
- ☆127Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- Collection of Open Source Speech Data☆159Updated 9 months ago
- openai/whisper + extra features☆89Updated 2 years ago
- whisper.cpp bindings for python☆101Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- ☆273Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 9 months ago
- openvino version of openai/whisper☆174Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆172Updated 3 weeks ago
- Open-source reproducible benchmarks from Argmax☆53Updated this week
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year