vell001 / audio-annotator
音频标注工具
☆77Updated 3 years ago
Alternatives and similar repositories for audio-annotator:
Users that are interested in audio-annotator are comparing it to the libraries listed below
- PyTorch reimplementation of Tacotron2 in Mandarin☆81Updated 3 years ago
- (已过时)WaveNet 声码器☆21Updated 4 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆74Updated 2 years ago
- A simple model implemented with tensorflow for voiceprint☆87Updated 6 years ago
- tacotron+griffin Lim synthetic mandarin voice☆26Updated last year
- 主要参考李宏毅老师2020年人类语言处理课程资 料整理,包括代码和ppt☆35Updated 3 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆300Updated 4 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆397Updated 4 years ago
- Tools for ASR Corpus Generation from Online Video☆139Updated 6 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆27Updated 5 years ago
- ☆106Updated 3 years ago
- Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)☆122Updated last year
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Voice Activity Detector☆72Updated 2 years ago
- 方言分类,pytorch☆41Updated 6 years ago
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆33Updated 4 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆122Updated 5 years ago
- 语音处理,声源定位中的一些基本特征☆50Updated 6 years ago
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Updated 6 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆117Updated 2 years ago
- 基于dVector的说话人识别keras☆88Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆165Updated 2 years ago
- ☆142Updated 4 years ago
- ☆50Updated 4 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- Tensorflow version of DFSMN☆49Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.