andrespimartin / weighted-x-entropy-asrView external linksLinks
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition
☆15Sep 3, 2024Updated last year
Alternatives and similar repositories for weighted-x-entropy-asr
Users that are interested in weighted-x-entropy-asr are comparing it to the libraries listed below
Sorting:
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆30Apr 26, 2024Updated last year
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- ☆11Sep 4, 2023Updated 2 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- ☆40Apr 2, 2025Updated 10 months ago
- ☆11Dec 24, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆17Updated this week
- The electronic Holly Quran browser Elforkane☆11Nov 14, 2021Updated 4 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267May 19, 2024Updated last year
- FunASR安卓端侧离线版本2pass全模式☆14Sep 4, 2023Updated 2 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Generalize then Adapt: Source-free Domain Adaptation for Semantic Segmentation (ICCV 2021)☆10Oct 12, 2021Updated 4 years ago
- Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"☆21Jan 18, 2026Updated 3 weeks ago
- Official repo of the paper Deep Regression Unlearning accepted in ICML 2023☆14Jun 14, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Oct 12, 2024Updated last year
- ☆32Nov 18, 2025Updated 2 months ago
- ☆11Jan 22, 2017Updated 9 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 2 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 7 months ago
- ☆46Feb 16, 2023Updated 3 years ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Data-enriching GAN for retrieving Representative Samples from aTrained Classifier☆14Sep 2, 2020Updated 5 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆12Oct 31, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- A tool for visualizing emotions in music using a Python wrapper for Spotify API. Independent post-baccalaureate research by Nick Stapleto…☆13Jun 2, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Aug 20, 2024Updated last year