matallanas / whisper_gpt_pipeline
A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_gpt_pipeline:
Users that are interested in whisper_gpt_pipeline are comparing it to the libraries listed below
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- ☆153Updated last year
- ☆62Updated 6 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- Speaker Diarization with Transformers☆64Updated 8 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆96Updated 9 months ago
- ☆38Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week
- Code for OpenAI Whisper Web App Demo☆94Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 7 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆50Updated last year
- ☆147Updated last year
- A testing repo to share code and thoughts on diarisation☆53Updated 10 months ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- BIG: Back In the Game of Creative AI☆26Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suite☆56Updated 11 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated 2 weeks ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated last year
- The demo page of UniAudio☆34Updated 11 months ago
- A simple voice conversion tool☆17Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆63Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆28Updated last year
- text-to-audio-latent-diffusion☆37Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆45Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year