apple / ml-code-switched-speech-translation
This repository contains the code and instructions needed to reproduce the dataset splits for out paper "Speech Translation for Code-Switched Speech".
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ml-code-switched-speech-translation
- ☆84Updated 7 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆75Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated last year
- ☆19Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆21Updated 3 months ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆26Updated last year
- Official code for Wav2Seq☆95Updated 2 years ago
- ☆24Updated 3 weeks ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆20Updated 3 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- Taiwanese Speech Synthesis with Tacotron2☆18Updated 2 years ago
- ☆12Updated last year
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- asr2k☆48Updated 5 months ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- Temporary anonymous version☆22Updated 8 months ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- Implementation of Google's USM speech model in Pytorch☆25Updated last week
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- ☆41Updated 2 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆33Updated 3 years ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆49Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- ☆54Updated this week
- Training code and trained checkpoints for ASGAN.☆60Updated 10 months ago
- ☆32Updated 2 months ago
- ☆42Updated 2 years ago