Shelton1013 / Whisper_MCELinks
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆26Updated last month
Alternatives and similar repositories for Whisper_MCE
Users that are interested in Whisper_MCE are comparing it to the libraries listed below
Sorting:
- ☆23Updated 8 months ago
- ☆14Updated 11 months ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆31Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆21Updated 6 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆62Updated last month
- ☆13Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆37Updated last year
- A handy dataset of noises for ASR☆21Updated 6 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 9 months ago
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆36Updated 3 weeks ago
- ☆31Updated 11 months ago
- The project for speech translation☆11Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 8 months ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆37Updated 4 years ago
- Collection of scripts from mHuBERT-147.☆25Updated 6 months ago
- Speech samples and code of BEdit-TTS☆33Updated last year
- CTC decoder with hotwords for ASR.☆20Updated last month
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 9 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆47Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- Python wrapper for kaldi's arpa2fst☆38Updated 6 months ago
- ☆26Updated 4 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆35Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆23Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆13Updated 8 months ago
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆21Updated this week
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago