matallanas / whisper_gpt_pipelineLinks
A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_gpt_pipeline
Users that are interested in whisper_gpt_pipeline are comparing it to the libraries listed below
Sorting:
- ☆158Updated 2 years ago
- ☆62Updated last year
- ☆261Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆46Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- openvino version of openai/whisper☆172Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- ☆149Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- openai/whisper + extra features☆89Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- ☆107Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆255Updated last year
- create dataset from list of youtube links easily☆21Updated 2 years ago
- ☆359Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆69Updated 2 months ago
- Speaker Diarization with Transformers☆69Updated 2 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆247Updated 2 years ago