apple / ml-code-switched-speech-translation
This repository contains the code and instructions needed to reproduce the dataset splits for out paper "Speech Translation for Code-Switched Speech".
☆29Updated 2 years ago
Alternatives and similar repositories for ml-code-switched-speech-translation:
Users that are interested in ml-code-switched-speech-translation are comparing it to the libraries listed below
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- ☆56Updated 2 years ago
- Official code for Wav2Seq☆96Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆42Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆31Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 5 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 6 months ago
- ☆11Updated 3 years ago
- ☆35Updated 3 months ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- multilingual speech aligner☆73Updated last year
- ☆23Updated last year
- asr2k☆48Updated 7 months ago
- ☆37Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆23Updated 2 months ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆49Updated 2 years ago
- ☆84Updated 9 months ago
- Acoustic Neighbor Embeddings☆18Updated last month
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 8 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- Temporary anonymous version☆22Updated 9 months ago
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆48Updated last year