基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。
☆48Oct 12, 2024Updated last year
Alternatives and similar repositories for My-FunASR
Users that are interested in My-FunASR are comparing it to the libraries listed below
Sorting:
- 介绍docker、docker compose的使用。☆21Sep 4, 2024Updated last year
- LLaMA-Factory使用经验记录☆41Aug 26, 2024Updated last year
- Turn your Claude Code subscription to an OpenAI API compatible provider☆27Feb 20, 2026Updated last week
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated last year
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆14Sep 3, 2024Updated last year
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Jan 14, 2022Updated 4 years ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- 一个微博毒舌AI,疯狂 diss 微博博主☆15Jan 2, 2025Updated last year
- ☆22Jul 8, 2019Updated 6 years ago
- ☆11Feb 25, 2026Updated last week
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 8 months ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆190Nov 10, 2024Updated last year
- Create Slides with a simple MCP server using Python PPTX library☆35May 20, 2025Updated 9 months ago
- ☆49Nov 26, 2023Updated 2 years ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆51Jul 25, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆22Jul 31, 2021Updated 4 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆25Jul 1, 2024Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- MFIN7036 NLP Course Project☆10Jul 25, 2024Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆35Dec 12, 2024Updated last year
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- ☆23Updated this week
- ☆29Aug 8, 2024Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- a simple lightweight large language model pipeline framework.☆28Apr 25, 2025Updated 10 months ago