alphacep / whisper-prompts
OpenAI Whisper Prompt Examples
☆39Updated last year
Related projects ⓘ
Alternatives and complementary repositories for whisper-prompts
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆132Updated 9 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- Various speech datasets made available to the public☆98Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆70Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- ☆51Updated last week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆72Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- asr2k☆48Updated 5 months ago
- ☆19Updated 6 years ago
- ☆64Updated last year
- Zero-shot Audio Classification using Whisper☆74Updated last year
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆62Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- ☆40Updated last year
- ☆38Updated 2 years ago
- ☆31Updated 2 months ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆41Updated 3 years ago
- ☆33Updated 3 years ago
- ☆16Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆37Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Simple Diarization model☆42Updated 11 months ago
- A python package for deep multilingual punctuation prediction.☆94Updated 2 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆58Updated 4 months ago
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆134Updated this week