vell001 / audio-annotatorLinks
音频标注工具
☆82Updated 3 years ago
Alternatives and similar repositories for audio-annotator
Users that are interested in audio-annotator are comparing it to the libraries listed below
Sorting:
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- Tools for ASR Corpus Generation from Online Video☆140Updated 6 years ago
- PyTorch reimplementation of Tacotron2 in Mandarin☆82Updated 4 years ago
- A pytorch based end2end speech recognition system.☆114Updated 4 years ago
- A simple model implemented with tensorflow for voiceprint☆88Updated 6 years ago
- Tensorflow version of DFSMN☆49Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆35Updated 4 years ago
- The repo provides information about KeSpeech dataset.☆145Updated 2 years ago
- ☆29Updated 5 years ago
- A release version for https://github.com/athena-team/athena☆127Updated 2 years ago
- ASR for Chinese Mandarin☆75Updated 7 years ago
- Mandarin ASR system based on tensorflow☆108Updated 6 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- ☆143Updated 4 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆75Updated 2 years ago
- ☆106Updated 4 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- ☆61Updated 2 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆69Updated 2 months ago
- chinese tts☆74Updated 4 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆401Updated 5 years ago
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆33Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆302Updated 4 years ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- A ctc decoder for both online and offline asr model☆65Updated last year
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- tacotron+griffin Lim synthetic mandarin voice☆26Updated last year
- A summary of speech data augment algorithms☆68Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago