may- / joeys2t
Minimalist Speech-to-Text toolkit for educational purposes
☆12Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for joeys2t
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- asr2k☆48Updated 5 months ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆8Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- ☆33Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆23Updated 2 weeks ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆30Updated last year
- Dataset Release for Intent Classification from Speech☆45Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆42Updated 5 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆21Updated this week
- Speech in Flax/JAX☆15Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆18Updated last year
- ☆16Updated 5 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- ☆17Updated last year