fcakyon / pywhisperLinks
openai/whisper + extra features
☆89Updated 2 years ago
Alternatives and similar repositories for pywhisper
Users that are interested in pywhisper are comparing it to the libraries listed below
Sorting:
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- ☆158Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆234Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- openvino version of openai/whisper☆170Updated last year
- A testing repo to share code and thoughts on diarisation☆55Updated last year
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated 2 years ago
- Coqui AI TTS plugin☆85Updated last month
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- generate granular word-level captions in srt format☆57Updated 2 years ago
- ☆62Updated last year
- ☆359Updated last year
- ☆149Updated 2 years ago
- ☆83Updated last year
- ☆64Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- ☆70Updated 3 months ago
- ☆14Updated 2 years ago
- OpenAI Whisper + davinci for podcast summarization☆71Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Audio datasets, easier.☆84Updated last year
- llama-4bit-colab☆64Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago