cisnlp / MaskLIDLinks

💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024

☆10

Alternatives and similar repositories for MaskLID

Users that are interested in MaskLID are comparing it to the libraries listed below

Sorting:

30stomercury / hmm-backprop
Fast and differentiable hidden Markov model in C++
☆17Updated 2 years ago
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated last year
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆18Updated 3 weeks ago
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
sushant-t / tts-trainer
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆29Updated 2 years ago
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
revsic / torch-whisper-guided-vc
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Updated 2 years ago
y-chan / hifi-gan-misrnet
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Updated 2 years ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
atosystem / SSL_Interface
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆15Updated 8 months ago
slp-rl / salmon
The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)
☆46Updated 3 months ago
utter-project / mHuBERT-147-scripts
Collection of scripts from mHuBERT-147.
☆29Updated 8 months ago
ShovalMessica / NAST
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆46Updated last year
frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated 2 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
Prem-kumar27 / Fast-KTSpeechCrawler
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆24Updated 4 years ago
zjlww / dsp
Digital Speech Processing in PyTorch.
☆14Updated 2 years ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆14Updated 7 months ago
ufal / SimulStreaming
☆31Updated last week
SonyResearch / diffvox
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆28Updated this week
amazon-science / iwslt-autodub-task
☆20Updated last year
voidful / vall-e-encodec
☆41Updated 2 years ago
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year
D-Keqi / LS-Transducer-SST
☆11Updated last year
MiscellaneousStuff / PhoneLM
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Updated last year