Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition
☆14Sep 3, 2024Updated last year
Alternatives and similar repositories for weighted-x-entropy-asr
Users that are interested in weighted-x-entropy-asr are comparing it to the libraries listed below
Sorting:
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆31Apr 26, 2024Updated last year
- ☆25Feb 13, 2026Updated 3 weeks ago
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- ☆11Sep 4, 2023Updated 2 years ago
- ☆42Apr 2, 2025Updated 11 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- ☆11Dec 24, 2024Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆269May 19, 2024Updated last year
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- FunASR安卓端侧离线版本2pass全模式☆14Sep 4, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Oct 12, 2024Updated last year
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- ☆46Feb 16, 2023Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- Generalize then Adapt: Source-free Domain Adaptation for Semantic Segmentation (ICCV 2021)☆10Oct 12, 2021Updated 4 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 8 months ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Embedded Tajweed annotation for the Qur'an☆11Nov 30, 2025Updated 3 months ago
- ☆11Jan 22, 2017Updated 9 years ago
- ☆11Nov 7, 2024Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- Data-enriching GAN for retrieving Representative Samples from aTrained Classifier☆14Sep 2, 2020Updated 5 years ago
- ☆14Aug 9, 2021Updated 4 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆17Feb 23, 2025Updated last year
- ☆16Sep 19, 2023Updated 2 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 5 months ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Overlap-and-add convolution in Python aimed at applying reverberation in music and audio signals☆12Sep 12, 2017Updated 8 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Aug 20, 2024Updated last year