vell001 / audio-annotatorLinks
音频标注工具
☆87Updated 4 years ago
Alternatives and similar repositories for audio-annotator
Users that are interested in audio-annotator are comparing it to the libraries listed below
Sorting:
- PyTorch reimplementation of Tacotron2 in Mandarin☆84Updated 4 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- ☆30Updated 6 years ago
- A simple model implemented with tensorflow for voiceprint☆88Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆33Updated 4 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Updated 6 years ago
- chinese tts☆75Updated 5 years ago
- ASR for Chinese Mandarin☆76Updated 7 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 5 years ago
- Tools for ASR Corpus Generation from Online Video☆140Updated 6 years ago
- A repository for Chinese text normalization.☆20Updated 4 years ago
- Mandarin ASR system based on tensorflow☆108Updated 7 years ago
- tacotron+griffin Lim synthetic mandarin voice☆26Updated 2 years ago
- Tensorflow version of DFSMN☆49Updated 7 years ago
- A Demo of Mandarin/Chinese TTS frontend☆284Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Updated 5 years ago
- The repo provides information about KeSpeech dataset.☆164Updated 3 years ago
- 基于dVector的说话人识别keras☆90Updated 5 years ago
- ☆15Updated 6 years ago
- A pytorch based end2end speech recognition system.☆116Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Updated 6 years ago
- 🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application☆168Updated 7 years ago
- A ctc decoder for both online and offline asr model☆64Updated 2 years ago
- ☆55Updated 5 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆29Updated 6 years ago
- ☆40Updated 4 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Updated 7 years ago
- VAD(Voice Activity Detector) python 实现对时时读入的流式数据进行端点检测☆49Updated 10 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Updated 7 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated 2 years ago