yeyupiaoling / AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
☆464Updated 3 weeks ago
Alternatives and similar repositories for AudioClassification-Pytorch:
Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below
- A python code based on pytorch applied to AudioClassification☆45Updated 2 years ago
- 基于Pytorch实现的语音情感识别☆180Updated 2 weeks ago
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆91Updated 3 weeks ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆934Updated 3 weeks ago
- 基于Tensorflow实现声音分类,博客地址:☆99Updated 4 years ago
- The dataset of Speech Recognition☆409Updated 3 months ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆664Updated 11 months ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆653Updated last week
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆90Updated 2 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆195Updated 4 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆259Updated 3 weeks ago
- 使用python进行语音识别☆147Updated 3 years ago
- 使用Tensorflow实现声纹识别☆308Updated 9 months ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆521Updated last month
- Audio Split 基于双门限法的语音端点检测及语音分割☆132Updated 4 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆93Updated 5 years ago
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆549Updated 4 months ago
- 用CASIA database数据集做的,做的语音情感识别和语音识人的练习☆64Updated 2 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆696Updated last year
- A must-read paper for speech separation based on neural networks☆773Updated 2 years ago
- 这个项目将 RAVDESS 数据集切割成 1s 短语音,利用 openSMILE+CNN 进行训练,目标是将短语音分类到四种情感中,分别是:开心(happy)、悲伤(sad)、生气(angry)和中性(neutral)。最后准确率达到 76% 左右。☆56Updated 3 years ago
- An Open Source Tools for Speaker Recognition☆613Updated 7 months ago
- ☆147Updated 2 years ago
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆724Updated 3 months ago
- 语音感情识别☆35Updated 3 weeks ago
- You can find the speech algorithms you want here☆792Updated 2 months ago
- 语音信号处理试验教程,Python代码☆322Updated 3 years ago
- speech enhancement\speech seperation\sound source localization☆1,106Updated last year
- 基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目 。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型☆846Updated last week
- ☆418Updated last year