matallanas / whisper_gpt_pipelineLinks
A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_gpt_pipeline
Users that are interested in whisper_gpt_pipeline are comparing it to the libraries listed below
Sorting:
- ☆157Updated last year
- ☆62Updated 11 months ago
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- create dataset from list of youtube links easily☆19Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- BSRGAN-Pip: Packaged version of the BSRGAN repository☆14Updated 2 years ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆39Updated last year
- Speaker Diarization with Transformers☆67Updated 2 weeks ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- ☆16Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Create training data for training a voice cloner for bark text to speech.☆45Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- ☆359Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 8 months ago
- openai/whisper + extra features☆89Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆27Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago