zzasdf/VietASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zzasdf/VietASR)

zzasdf / VietASR

☆52

Alternatives and similar repositories for VietASR

Users that are interested in VietASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhihaoDU / du2022sond
View on GitHub
Speaker overlap-aware Neural Diarization
☆12Feb 13, 2023Updated 3 years ago
SpeechColab / GigaSpeechBench
View on GitHub
☆29Updated this week
jeremy110 / Finetune_Nemo_ASR
View on GitHub
Finetune Nemo parakeet ASR model with new language (support 8 bit optimizer). Experimental birwkv-fastconformer TDT for long-form ASR(8.5…
☆26Nov 27, 2025Updated 7 months ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆34Sep 25, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
hitz-zentroa / whisper-lm
View on GitHub
Add n-gram and large language model (LLM) support to Whisper models.
☆43May 6, 2025Updated last year
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
Kaljurand / net-speech-api
View on GitHub
Java API for the online speech recognition services provided by phon.ioc.ee
☆18Jun 4, 2021Updated 5 years ago
AudenAI / Auden
View on GitHub
☆71Apr 2, 2026Updated 3 months ago
SpeechColab / GigaSpeech2
View on GitHub
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆197Apr 28, 2026Updated 2 months ago
chuoibo / VocalMind
View on GitHub
End to End Speech to Speech with Emotion System
☆15Feb 6, 2025Updated last year
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆126Mar 25, 2025Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dangtr0408 / StyleTTS2-lite
View on GitHub
A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.
☆50May 22, 2025Updated last year
khanld / chunkformer
View on GitHub
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
☆82Jun 9, 2026Updated last month
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
DAMO-NLP-SG / SeaLLMs-Audio
View on GitHub
☆53Dec 7, 2025Updated 7 months ago
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
SparkAudio / SparkVox
View on GitHub
☆37Jun 9, 2025Updated last year
ZhikangNiu / arxiv_daily
View on GitHub
☆22May 25, 2026Updated last month
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mkunes / w2v2_audioFrameClassification
View on GitHub
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆42Aug 11, 2023Updated 2 years ago
k2-fsa / text_search
View on GitHub
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆79Jun 30, 2025Updated last year
manhdh32 / 1st_kalapa_ocr
View on GitHub
☆11Jan 1, 2024Updated 2 years ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,018Dec 2, 2025Updated 7 months ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 3 years ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
yxduir / m2m-70
View on GitHub
☆18Jun 25, 2026Updated 3 weeks ago
tomer9080 / WhisperRT-Streaming
View on GitHub
Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
☆75Mar 31, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
agrija9 / Avalinguo-Audio-Set
View on GitHub
Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification
☆13Aug 13, 2018Updated 7 years ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
mycrazycracy / Backends-for-SRE19
View on GitHub
This repository will illustrate the use of some different backends on NIST SRE 2019.
☆21Apr 25, 2020Updated 6 years ago
k2-fsa / icefall
View on GitHub
☆1,457Jul 16, 2026Updated last week
EraX-AI / viF5TTS
View on GitHub
EraX Text to Speech base on F5-TTS Base V1
☆81May 8, 2025Updated last year
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago