将 Qwen3-ASR 的 LLM 部分导出为 GGUF,用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。
☆172Apr 29, 2026Updated last month
Alternatives and similar repositories for Qwen3-ASR-GGUF
Users that are interested in Qwen3-ASR-GGUF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆101May 26, 2026Updated 2 weeks ago
- A lightweight demo of FunASR-Nano using ONNX runtime.☆82Feb 25, 2026Updated 3 months ago
- Implementation of Qwen3-ASR-0.6B in GGML☆93Feb 10, 2026Updated 4 months ago
- Transcribe subtitles and translate them offline with ease.☆45Jan 10, 2026Updated 5 months ago
- shortwave reception software☆14Jul 17, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 让Claude Code不再是黑箱☆40Apr 17, 2026Updated last month
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- Code for "Speaker Clustering using Dominant Sets", ICPR 2018☆11Nov 28, 2020Updated 5 years ago
- The app exhibition hall for the development of vis-three and its derivatives☆12Dec 9, 2023Updated 2 years ago
- 一个基于MTranServer 的 bob 翻译插件,让你告别用量焦虑和速度焦虑。☆49Apr 2, 2026Updated 2 months ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated 2 years ago
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RO-SCIRAW框架是由Kirk Lin开创的提示词方法论,为构建高度精确和高效的提示词提供了一个全新的范式。☆17Jul 29, 2024Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago
- Lean neural real-time acoustic echo cancellation with soft delay estimation - GGML and PyTorch inference☆104May 29, 2026Updated 2 weeks ago
- F5-TTS 推理加速,速度提升约4倍!☆124Jan 6, 2025Updated last year
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- 基于cloudflare的直链加速下载服务☆11Jul 6, 2023Updated 2 years ago
- 日语语音识别(ASR)模型☆33Jun 6, 2026Updated last week
- go-ocr 是一款基于 Golang + ONNX 构建的 OCR 工具库,专注于为 Go 生态提供简单易用、可扩展的文字识别能力。☆61Jan 26, 2026Updated 4 months ago
- ☆10May 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- Repo for our pooling approach on the DCASE2018 task4☆16Jul 6, 2023Updated 2 years ago
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆36Aug 5, 2025Updated 10 months ago
- An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…☆14Dec 27, 2022Updated 3 years ago
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15May 20, 2026Updated 3 weeks ago
- An out-of-the-box animation visualization editor | 一款开箱即用的动画可视化编辑器☆23May 12, 2025Updated last year
- Utilizes ONNX Runtime to transcribe audio into text.☆85Jun 6, 2026Updated last week
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on…☆1,267Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 前端抠图-canvas实现☆13May 4, 2020Updated 6 years ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆54Oct 12, 2024Updated last year
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- Rockchip's libRGA source unofficial mirror☆14Feb 6, 2026Updated 4 months ago
- 用多层BLSTM模型同时进行中文分词和标点符号预测☆18Nov 8, 2024Updated last year
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- A Unified/Remark plugin that injects a DOCX compiler using [`mdast2docx`](https://github.com/tiny-md/mdast2docx) and outputs `.docx` file…☆15Updated this week