kaihuhuang/Language-Group

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaihuhuang/Language-Group)

kaihuhuang / Language-Group

☆11

Alternatives and similar repositories for Language-Group

Users that are interested in Language-Group are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangers / subtools2
View on GitHub
egrecho project
☆11Apr 30, 2026Updated 2 months ago
gwh22 / LAFMA
View on GitHub
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆44Jun 13, 2024Updated 2 years ago
lzhxmu / AccDiffusion_v2
View on GitHub
Code release for AccDiffusionV2 (TPAMI)
☆34Nov 4, 2025Updated 8 months ago
PunkMale / OR-Gate
View on GitHub
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆12Oct 23, 2023Updated 2 years ago
liduojia1 / MeanFlowSE
View on GitHub
☆43Jan 26, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
Hannieliao / Baton
View on GitHub
Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"
☆32Mar 4, 2025Updated last year
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
nikvaessen / w2v2-speaker-few-samples
View on GitHub
Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688
☆13Dec 2, 2024Updated last year
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
lzhxmu / VTW
View on GitHub
Code release for VTW (AAAI 2025 Oral)
☆68Nov 4, 2025Updated 8 months ago
SXU-YaxinGuo / CRMU
View on GitHub
儿童故事常识推理与寓意理解评测（Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories，CRMU）
☆18Oct 22, 2024Updated last year
VincentHancoder / ViGoR-Bench-Eval
View on GitHub
☆34Apr 5, 2026Updated 3 months ago
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FreedomIntelligence / S2S-Arena
View on GitHub
☆21Jun 4, 2026Updated last month
zyxxmu / LBC
View on GitHub
Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity
☆22Jan 13, 2023Updated 3 years ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
jzshq208886 / wenet_asr
View on GitHub
☆12Jul 11, 2024Updated 2 years ago
gengxuelong / wenet_LLM_from_ASLP
View on GitHub
wenet_LLM_from_ASLP
☆15Nov 26, 2024Updated last year
VoxBlink / ScriptsForVoxBlink
View on GitHub
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆30Apr 16, 2024Updated 2 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
TaoRuijie / Loss-Gated-Learning
View on GitHub
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆92May 29, 2023Updated 3 years ago
HuangXiZhou / auto-build
View on GitHub
🤖 Node.js auto deploy demo based on GitHub Webhook
☆29Oct 23, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
kuan2jiu99 / Awesome-Speech-Generation
View on GitHub
Survey on speech generation work.
☆21Nov 26, 2023Updated 2 years ago
pxiangwu / TopoFilter
View on GitHub
NeurIPS 2020, "A Topological Filter for Learning with Label Noise".
☆31Apr 11, 2025Updated last year
ZhikangNiu / Semantic-VAE
View on GitHub
[INTERSPEECH 2026 Oral]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆120Jun 21, 2026Updated last month
kyegomez / MELLE
View on GitHub
An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"
☆16Updated this week
kwatcharasupat / divide-and-remaster-v3
View on GitHub
Landing Page for Divide and Remaster v3
☆26Jul 29, 2025Updated 11 months ago
YanZiBuGuiCHunShiWan / RESTFUL_ASR
View on GitHub
基于wenet的短时在线语音识别服务
☆11Feb 25, 2023Updated 3 years ago
IS2AI / MultilingualASR
View on GitHub
☆14Aug 9, 2021Updated 4 years ago
Gelelmaster / Funasr-Qwen-GPTSovits
View on GitHub
<综合> Funasr语音识别，调用Qwen大模型回答，通过GPTSovits输出语音的ai程序，其中调用模型还是在线，后续将添加离线大模型
☆13Nov 30, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lukeewin / ASR_LLM_TTS_Front
View on GitHub
ASR_LLM_TTS前端项目
☆15Dec 3, 2024Updated last year
ali-vilab / CAPability
View on GitHub
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
☆28May 16, 2025Updated last year
tmllab / 2021_NeurIPS_PES
View on GitHub
☆30Jan 7, 2023Updated 3 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
qinxiaoyi / Cross-Age_Speaker_Verification
View on GitHub
☆31Aug 28, 2022Updated 3 years ago
AI-confused / arxiv_auto_crawler
View on GitHub
auto scrawl for arrive data
☆16Jan 24, 2022Updated 4 years ago