FinvDialect / 2023_finvcup_baselineLinks
☆17Updated last year
Alternatives and similar repositories for 2023_finvcup_baseline
Users that are interested in 2023_finvcup_baseline are comparing it to the libraries listed below
Sorting:
- The repoduction codes for Qwen-Audio Fine-tuning☆39Updated 9 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Papers of ASR, Tools of ASR☆40Updated 3 months ago
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆39Updated 10 months ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Updated last year
- ☆29Updated 5 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆35Updated 4 years ago
- ☆25Updated 2 years ago
- Official release of StyleTalk dataset.☆64Updated 11 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated last week
- 2024 FinVolution Global Data Science Competition-9th baseline☆18Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆36Updated last week
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆71Updated 7 months ago
- End-to-end speech recognition on AISHELL dataset.☆32Updated 3 years ago
- magicspeech competition recipe☆18Updated 4 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆31Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆29Updated last year
- faster inference☆28Updated 4 months ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆73Updated last year
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆75Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆28Updated last year
- ☆36Updated 2 years ago
- A repository for Chinese text normalization.☆15Updated 4 years ago
- ☆15Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆47Updated last year
- Huawei Grad-TTS for Chinese☆50Updated last year