datawhalechina / hugging-audioLinks
Hugging Face Audio Course中文版,帮助学习者快速入门音频模态
☆37Updated last year
Alternatives and similar repositories for hugging-audio
Users that are interested in hugging-audio are comparing it to the libraries listed below
Sorting:
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆82Updated last year
- a chinese tutorial of git☆158Updated last year
- ☆28Updated 4 months ago
- 用于汇总目前的开源中文对话数据集☆187Updated 2 years ago
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆188Updated last year
- 解锁HuggingFace生态的百般用法☆97Updated 11 months ago
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆582Updated last year
- MFCC implementation with detailed comments.☆17Updated 4 years ago
- ☆204Updated last year
- B站视频课程配套资料☆39Updated 2 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆34Updated 4 years ago
- Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.☆151Updated 5 months ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆25Updated last year
- 本课程面对具有一定机器学习基础,但尚未入门的NLPer或经验尚浅的NLPer,尽力避免陷入繁琐枯燥的公式讲解中,力求用代码展示每个模型背后的设计思想,同时也会带大家梳理每个模块下的技术演变,做到既知树木也知森林。☆90Updated last year
- ☆21Updated 2 years ago
- Datawhale论文分享,阅读前沿论文,分享技术创新☆51Updated last year
- 语音识别 论文 前沿☆50Updated 3 years ago
- 🤗 R1-AQA Model: mispeech/r1-aqa☆306Updated 7 months ago
- TinyRAG☆368Updated 4 months ago
- 大模型/LLM推理和部署理论与实践☆358Updated 4 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆165Updated 3 weeks ago
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆555Updated 6 months ago
- 本仓库将带大家从零开始,用pytorch的线性层搭建传统的NLP神经网络☆42Updated 11 months ago
- 基于python的语音识别服务部署,任何一个支持一句话解码的ASR模型接口,都可仿照该框架部署自己的语音识别服务☆52Updated 3 years ago
- llama-omni训练代码复现☆69Updated 10 months ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆356Updated 5 months ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆103Updated 2 years ago
- ☆167Updated last year
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆500Updated 8 months ago
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆337Updated 5 months ago