FinvDialect / 2023_finvcup_baseline
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for 2023_finvcup_baseline
- End-to-end speech recognition on AISHELL dataset.☆30Updated 3 years ago
- Papers of ASR, Tools of ASR☆38Updated last year
- 主要参考李宏毅 老师2020年人类语言处理课程资料整理,包括代码和ppt☆33Updated 3 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆10Updated last year
- The repoduction codes for Qwen-Audio Fine-tuning☆23Updated 3 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆26Updated 8 months ago
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆37Updated 4 months ago
- ☆15Updated 2 years ago
- SafeEar: Content Privacy-Preserving Audio Deepfake Detection (Accepted by CCS 2024)☆45Updated this week
- ☆26Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆19Updated 2 years ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆32Updated last month
- 单独维护的中文TTS☆35Updated 2 years ago
- Speech samples and code of BEdit-TTS☆32Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆37Updated 3 weeks ago
- 语音识别 论文 前沿☆43Updated 2 years ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆63Updated 2 years ago
- ☆29Updated 5 years ago
- Official release of StyleTalk dataset.☆57Updated 4 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆73Updated 2 years ago
- A repository for Chinese text normalization.☆14Updated 3 years ago
- End-to-end Speech Translation☆36Updated 3 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Updated 2 years ago