kaihuhuang / Language-Group
☆9Updated last month
Alternatives and similar repositories for Language-Group:
Users that are interested in Language-Group are comparing it to the libraries listed below
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated 10 months ago
- ☆144Updated 2 years ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆239Updated 2 weeks ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆47Updated last year
- UT-Sarulab MOS prediction system using SSL models☆202Updated 9 months ago
- ☆55Updated 8 months ago
- ☆26Updated 2 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆37Updated last year
- ☆76Updated 5 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 7 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆75Updated 8 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆24Updated 9 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆138Updated last month
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆75Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- ☆11Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆21Updated last month
- Target Speaker Extraction Toolkit☆141Updated this week
- ☆43Updated last year
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆184Updated 9 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆54Updated 4 months ago
- ☆32Updated 3 years ago
- ☆11Updated last month
- Reference-aware automatic speech evaluation toolkit☆140Updated last month
- It's a repository for implementations of neural speech editing algorithms.☆193Updated last year
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆47Updated last month
- ☆149Updated 6 months ago
- Recipe for LibriPhrase☆27Updated last year
- Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆17Updated last month
- ☆13Updated 2 years ago