FunAudioLLM/FunAudioLLM-APP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FunAudioLLM/FunAudioLLM-APP)

FunAudioLLM / FunAudioLLM-APP

☆384

Alternatives and similar repositories for FunAudioLLM-APP

Users that are interested in FunAudioLLM-APP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,888Updated this week
FunAudioLLM / FunAudioLLM.github.io
View on GitHub
☆58Updated this week
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,265May 25, 2026Updated last month
fun-audio-llm / fun-audio-llm.github.io
View on GitHub
FunAudioLLM homepage
☆17Dec 11, 2024Updated last year
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,333Updated this week
Eric0308 / xiaozhi-client
View on GitHub
这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。
☆27Mar 14, 2025Updated last year
FunAudioLLM / FunResearch
View on GitHub
This repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge researc…
☆35Jun 2, 2026Updated last month
lovemefan / SenseVoice-python
View on GitHub
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆114Jun 12, 2026Updated last month
QwenLM / Qwen2-Audio
View on GitHub
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
☆2,088Apr 21, 2025Updated last year
FunAudioLLM / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,369May 20, 2025Updated last year
RapidAI / RapidASR
View on GitHub
📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …
☆608May 15, 2024Updated 2 years ago
xinchen-ai / Westlake-Omni
View on GitHub
☆203Sep 24, 2024Updated last year
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lovemefan / SenseVoice.cpp
View on GitHub
Port of Funasr's Sense-voice model in C/C++
☆567Dec 19, 2025Updated 7 months ago
v3ucn / ASR_TOOLS_SenseVoice_WebUI
View on GitHub
Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型
☆182Jul 10, 2024Updated 2 years ago
gpt-omni / mini-omni
View on GitHub
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…
☆3,562Nov 5, 2024Updated last year
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,204Dec 5, 2024Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
modelscope / FunClip
View on GitHub
FunASR-powered video transcription, subtitle generation, and LLM-assisted clipping tool with a local Gradio UI.
☆5,990Updated this week
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆465Jun 15, 2026Updated last month
stepfun-ai / Step-Audio
View on GitHub
☆32Mar 16, 2026Updated 4 months ago
lovemefan / paraformer-python
View on GitHub
paraformer(chinense asr) online onnx runtime for python
☆54Mar 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,651Apr 10, 2026Updated 3 months ago
Alittleegg / Eureka-Audio
View on GitHub
Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…
☆40Apr 11, 2026Updated 3 months ago
v3ucn / CosyVoice_For_Windows
View on GitHub
CosyVoice在Windows环境下使用的版本
☆767Nov 19, 2024Updated last year
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,327Jun 9, 2026Updated last month
calmstate / Itinerant
View on GitHub
A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.
☆18Aug 30, 2024Updated last year
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
lhl / voicechat2
View on GitHub
Local SRT/LLM/TTS Voicechat
☆775Oct 12, 2024Updated last year
QwenLM / Qwen-Audio
View on GitHub
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
☆1,913Jul 5, 2024Updated 2 years ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,055Dec 8, 2025Updated 7 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jzshq208886 / wenet_asr
View on GitHub
☆12Jul 11, 2024Updated 2 years ago
HG-ha / SenseVoice-Api
View on GitHub
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆112Sep 2, 2024Updated last year
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
ddlBoJack / Awesome-Speech-Language-Model
View on GitHub
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
☆201Jun 7, 2026Updated last month
NN-Project-2 / Emotion-TTS-Emebddings
View on GitHub
This project explores zero-shot emotional speech synthesis using EMOD, a novel approach combining emotion and content embeddings for mult…
☆18Jun 26, 2026Updated 3 weeks ago
78 / esp-opus-encoder
View on GitHub
ESP32 OPUS Encoder wrapper
☆16Aug 9, 2025Updated 11 months ago