matallanas / whisper_gpt_pipelineLinks
A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_gpt_pipeline
Users that are interested in whisper_gpt_pipeline are comparing it to the libraries listed below
Sorting:
- ☆62Updated last year
- ☆261Updated last year
- ☆157Updated 2 years ago
- ☆147Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆47Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- openai/whisper + extra features☆89Updated 2 years ago
- ☆39Updated last year
- A list of scripts/notebooks I'd like to keep handy☆18Updated last year
- ☆106Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆232Updated last year
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆19Updated 2 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆117Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 7 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Frontend UI and Backend Server for Stable Diffusion models☆31Updated 2 years ago
- openvino version of openai/whisper☆176Updated last year
- Speaker Diarization with Transformers☆69Updated 4 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- generate granular word-level captions in srt format☆57Updated 3 years ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆149Updated last year