XiaomiMiMo/MiMo-V2.5-ASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XiaomiMiMo/MiMo-V2.5-ASR)

XiaomiMiMo / MiMo-V2.5-ASR

☆73

Alternatives and similar repositories for MiMo-V2.5-ASR

Users that are interested in MiMo-V2.5-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆38Mar 31, 2026Updated 3 weeks ago
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Apr 7, 2026Updated 3 weeks ago
yangdongchao / Omni-AutoThink
View on GitHub
Adaptive Multimodal Reasoning via Reinforcement Learning
☆23Jan 11, 2026Updated 3 months ago
vivian556123 / NeurIPS2024-CoVoMix
View on GitHub
Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
☆66Jan 16, 2025Updated last year
snsun / LSTM_PIT
View on GitHub
☆10Sep 18, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆15Jun 30, 2023Updated 2 years ago
usnistgov / F4DE
View on GitHub
Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…
☆25Jul 6, 2017Updated 8 years ago
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆15Mar 26, 2025Updated last year
ASLP-lab / WenetSpeech-Chuan
View on GitHub
Official repository for the WenetSpeech-Chuan dataset.
☆176Feb 5, 2026Updated 2 months ago
Mddct / transformer-vocos
View on GitHub
☆36Sep 6, 2025Updated 7 months ago
xingchensong / TouchNet
View on GitHub
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
☆230Apr 8, 2026Updated 3 weeks ago
danpovey / conditional-flow-matching
View on GitHub
☆30Aug 8, 2024Updated last year
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
wangdong99 / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆27Jun 13, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆101Nov 1, 2025Updated 5 months ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆36Aug 15, 2019Updated 6 years ago
nwpuaslp / TTS_Course
View on GitHub
☆70Nov 30, 2020Updated 5 years ago
zxs731 / raspbarry_qwen2.5_omni
View on GitHub
树莓派qwen-omni语音助手免TTS/STT
☆16Apr 4, 2025Updated last year
nafiuny / ICRCycleGAN-VC
View on GitHub
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Apr 15, 2026Updated 2 weeks ago
nonverbalspeech38k / nonverspeech38k
View on GitHub
The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…
☆65Dec 26, 2025Updated 4 months ago
cuhealthybrains / MT-LLM
View on GitHub
The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"
☆50Apr 7, 2025Updated last year
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zxzhao0 / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆47Mar 3, 2025Updated last year
halsay / ASR-TTS-paper-daily
View on GitHub
Update ASR paper everyday
☆506Updated this week
lmxue / Audio-FLAN
View on GitHub
Audio-FLAN
☆160Sep 23, 2025Updated 7 months ago
ASLP-lab / WenetSpeech-Wu-Repo
View on GitHub
A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations
☆143Feb 6, 2026Updated 2 months ago
01Zhangbw / Speech-and-audio-papers-Top-Conference
View on GitHub
☆139Jan 24, 2026Updated 3 months ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆13Oct 14, 2023Updated 2 years ago
XiaomiMiMo / MiMo-Audio-Training
View on GitHub
☆104Oct 16, 2025Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
Soul-AILab / SoulX-Singer-Eval
View on GitHub
A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis
☆26Feb 11, 2026Updated 2 months ago
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆181Dec 5, 2024Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 6 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
BUTSpeechFIT / hystoc
View on GitHub
Getting confidences from any end-to-end systems
☆11May 24, 2023Updated 2 years ago