datawhalechina / hugging-audio
Hugging Face Audio Course中文版,帮助学习者快速入门音频模态
☆31Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for hugging-audio
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆109Updated last week
- ASR教程: https://dataxujing.github.io/ASR-paper/☆23Updated 4 months ago
- MFCC implementation with detailed comments.☆16Updated 3 years ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆27Updated last month
- 用于汇总目前的开源中文对话数据集☆116Updated last year
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆47Updated 6 months ago
- unofficial implementation of the High Fidelity Neural Audio Compression☆136Updated 3 months ago
- ☆173Updated last month
- flow mirror models from JZX AI Labs☆40Updated last month
- 语音识别 论文 前沿☆43Updated 2 years ago
- ☆15Updated 2 years ago
- Papers of ASR, Tools of ASR☆38Updated last year
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆370Updated 9 months ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆146Updated 5 months ago
- Pseudo Streaming SenseVoice with Hotwords☆85Updated 2 weeks ago
- 与Datawhale组织的现有仓库以及学习内容对话——快速找到你想学习的内容和贡献内容!☆29Updated 7 months ago
- a chinese tutorial of git☆146Updated 7 months ago
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆524Updated last week
- 大模型/LLM推理和部署理论与实践☆81Updated this week
- 基于python的语音识别服务部署,任何一个支持一句话解码的ASR模型接口,都可仿照该框架部署自己的语音识别服务☆50Updated 2 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆33Updated 3 years ago
- ☆13Updated last month
- This repository is the official implementation of the ECAI 2024 conference paper SUBLLM: A Novel Efficient Architecture with Token Sequen…☆68Updated 3 months ago
- 解锁HuggingFace生态的百般用法☆68Updated this week
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆35Updated last year
- ☆12Updated 3 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆144Updated last month
- The repo provides information about KeSpeech dataset.☆115Updated 2 years ago
- Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.☆201Updated 10 months ago
- 从零实现一个小参数量中文大语言模型。☆279Updated 3 months ago