matallanas / whisper_gpt_pipelineLinks
A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_gpt_pipeline
Users that are interested in whisper_gpt_pipeline are comparing it to the libraries listed below
Sorting:
- ☆158Updated 2 years ago
- ☆260Updated last year
- ☆149Updated 2 years ago
- ☆62Updated last year
- openvino version of openai/whisper☆170Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆234Updated last year
- ☆39Updated last year
- Create training data for training a voice cloner for bark text to speech.☆45Updated 2 years ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- ☆83Updated last year
- ☆359Updated last year
- openai/whisper + extra features☆89Updated 2 years ago
- ☆16Updated last year
- ☆86Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆116Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Speaker Diarization with Transformers☆69Updated last month
- ☆27Updated 2 years ago
- ☆70Updated 3 months ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆30Updated last year
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 9 months ago