yuekaizhang / Fun-ASR-vllmView external linksLinks
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
☆71Jan 14, 2026Updated last month
Alternatives and similar repositories for Fun-ASR-vllm
Users that are interested in Fun-ASR-vllm are comparing it to the libraries listed below
Sorting:
- Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆62Feb 7, 2026Updated last week
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated last week
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Official code for "Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis"☆56Feb 3, 2026Updated last week
- Official Implementation of GLAP - General Language Audio Pretraining☆61Jan 5, 2026Updated last month
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆122Dec 4, 2025Updated 2 months ago
- A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation☆271Feb 5, 2026Updated last week
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆884Feb 2, 2026Updated last week
- mnn asr demo.☆25Mar 24, 2025Updated 10 months ago
- low-latency realtime ASR based on FireRedASR☆57Jul 8, 2025Updated 7 months ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆224Aug 6, 2025Updated 6 months ago
- 基于ONNXRuntime以及LLama.cpp推理引擎实现的高性能C++语音推理框架,在性能极差的边缘设备上都能做到RTF<0.7实时对话。☆39Dec 23, 2025Updated last month
- ☆82Dec 31, 2025Updated last month
- In-car multi-channel speech transcription system of AISHELL-5.☆40Jun 9, 2025Updated 8 months ago
- ☆68Dec 30, 2025Updated last month
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- Llasa Speed Up☆57Jan 18, 2026Updated 3 weeks ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆75Jan 25, 2026Updated 3 weeks ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- 使用vllm加速cosyvoice2的推理☆482Apr 26, 2025Updated 9 months ago
- A lightweight Chinese/Cantonese to Pinyin library.☆42May 31, 2025Updated 8 months ago
- Utilizes ONNX Runtime for speech activity detection.☆41Dec 10, 2025Updated 2 months ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- Speech Emotion Recognition using Deep Learning☆12May 24, 2021Updated 4 years ago
- X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…☆176Updated this week
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆838Dec 2, 2025Updated 2 months ago
- ☆14Jun 10, 2025Updated 8 months ago
- Brand new TTS solution☆11Dec 7, 2024Updated last year
- 提供了一个极简的发电文案接口和一些云崽插件☆11Jan 17, 2025Updated last year
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆22Jan 4, 2026Updated last month
- LangGraph Mastery Playbook: guided, code-first lessons for building memory-aware LLM agents and workflows with LangGraph, TrustCall, and …☆40Nov 4, 2025Updated 3 months ago
- CV approach aimed to remove moving objects in videos (dynamic and static camera)☆11Mar 21, 2021Updated 4 years ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- 一个网页解谜游戏框架,支持制作/载入关卡包,支持多种关卡类型☆14Aug 18, 2014Updated 11 years ago
- Containerized self-hosted REST API for vision classification, utilizing Hugging Face transformers.☆10Dec 5, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year