andrespimartin/weighted-x-entropy-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/andrespimartin/weighted-x-entropy-asr)

andrespimartin / weighted-x-entropy-asr

Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition

☆14

Alternatives and similar repositories for weighted-x-entropy-asr

Users that are interested in weighted-x-entropy-asr are comparing it to the libraries listed below

Sorting:

YUCHEN005 / DPSL-ASR
View on GitHub
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆43May 23, 2023Updated 2 years ago
jiay7 / wenet_onlinedecode
View on GitHub
Went online decode demo
☆31Apr 28, 2021Updated 4 years ago
lovemefan / CT-Transformer-punctuation
View on GitHub
A enterprise-grade Chinese-English code switch punctuator from funasr.
☆31Apr 26, 2024Updated last year
ZoeLong98 / bento_portofolio_template
View on GitHub
☆25Feb 13, 2026Updated 3 weeks ago
levalleyjack / multipass-manager-vscode
View on GitHub
VS Code Extension for Multipass
☆10Sep 25, 2024Updated last year
jagabandhumishra / W2V-E2E-Language-Diarization
View on GitHub
☆11Sep 4, 2023Updated 2 years ago
NKU-HLT / AudioEditor
View on GitHub
☆42Apr 2, 2025Updated 11 months ago
moisesveleta / GOP-LSTM
View on GitHub
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Sep 26, 2018Updated 7 years ago
kaihuhuang / Language-Group
View on GitHub
☆11Dec 24, 2024Updated last year
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated last year
Srijith-rkr / Whispering-LLaMA
View on GitHub
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
☆269May 19, 2024Updated last year
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆11Nov 5, 2020Updated 5 years ago
SoonSYJ / fawasr
View on GitHub
FunASR安卓端侧离线版本2pass全模式
☆14Sep 4, 2023Updated 2 years ago
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
peilongchencc / My-FunASR
View on GitHub
基于FunASR实现语音识别，包含常规版和ONNX版(推荐)。
☆48Oct 12, 2024Updated last year
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆12Mar 15, 2025Updated 11 months ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆11Dec 15, 2022Updated 3 years ago
sinhat98 / adapter-wavlm
View on GitHub
☆46Feb 16, 2023Updated 3 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
YanZiBuGuiCHunShiWan / RESTFUL_ASR
View on GitHub
基于wenet的短时在线语音识别服务
☆11Feb 25, 2023Updated 3 years ago
val-iisc / SFDA-Seg
View on GitHub
Generalize then Adapt: Source-free Domain Adaptation for Semantic Segmentation (ICCV 2021)
☆10Oct 12, 2021Updated 4 years ago
zhoutuan / mod_funasr
View on GitHub
FreeSWITCH ASR module fork from mod_audio_stream， use FunASR online cpu version
☆16Jun 27, 2025Updated 8 months ago
henttttai / voice-to-voice-llm-structure
View on GitHub
自用，语音到文本用的sencevoice，llm部分基于ollama的API调用，文本到语音用的cosyvoice，实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆12Dec 26, 2024Updated last year
tarekeldeeb / quran-tajweed-embedded
View on GitHub
Embedded Tajweed annotation for the Qur'an
☆11Nov 30, 2025Updated 3 months ago
carlobar / BDT_latex
View on GitHub
☆11Jan 22, 2017Updated 9 years ago
uthree / ddsp-vocoder
View on GitHub
☆11Nov 7, 2024Updated last year
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 4 months ago
vcl-iisc / DeGAN
View on GitHub
Data-enriching GAN for retrieving Representative Samples from aTrained Classifier
☆14Sep 2, 2020Updated 5 years ago
IS2AI / MultilingualASR
View on GitHub
☆14Aug 9, 2021Updated 4 years ago
mush42 / hareef
View on GitHub
state-of-the-art models for diacritics restoration for Arabic language
☆17Feb 23, 2025Updated last year
mathieulagrange / ddspMusicBandwidthExtension
View on GitHub
☆16Sep 19, 2023Updated 2 years ago
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆29Oct 9, 2025Updated 5 months ago
agrija9 / Avalinguo-Audio-Set
View on GitHub
Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification
☆13Aug 13, 2018Updated 7 years ago
lvrysis / Audio-DNN-Classification
View on GitHub
Deep Neural Networks for audio classification
☆11Apr 11, 2024Updated last year
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Mar 26, 2022Updated 3 years ago
tiagoft / overlap-and-add
View on GitHub
Overlap-and-add convolution in Python aimed at applying reverberation in music and audio signals
☆12Sep 12, 2017Updated 8 years ago
PD-Mera / ctranslate2-triton-backend
View on GitHub
Triton backend for https://github.com/OpenNMT/CTranslate2
☆11Aug 20, 2024Updated last year