AISHELL开源数据标注平台,包含语音,图像标注,数据质检,验收,统计等功能.
☆25Dec 23, 2019Updated 6 years ago
Alternatives and similar repositories for aishell_annotation
Users that are interested in aishell_annotation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 25, 2021Updated 4 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- A dataset of pitch curves for music performance assessment☆10Jun 5, 2023Updated 2 years ago
- Inference SAM in C # based on OpenVINO, ONNX runtime, TensorRT☆19Jun 6, 2024Updated last year
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 4 months ago
- End-to-end speech recognition on AISHELL dataset.☆34Nov 9, 2021Updated 4 years ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Feb 10, 2026Updated last month
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- ☆91Dec 10, 2024Updated last year
- Mining effective negative training samples for keyword spotting (PyTorch)☆64May 23, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- ☆50Dec 26, 2020Updated 5 years ago
- Simple Kalman filter library for Arduino☆20May 30, 2019Updated 6 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆42Jan 17, 2025Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- ☆11Apr 5, 2023Updated 2 years ago
- Chinese Speech SDK for Android, iOS and embedded Linux platforms. http://ai.mobvoi.com☆23Jun 16, 2020Updated 5 years ago
- ☆12Oct 25, 2015Updated 10 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)☆33Sep 12, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- High quality slow motion video generation using deep neural nets and optical flow.☆10Jul 13, 2018Updated 7 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Mar 7, 2026Updated 2 weeks ago
- This released code is for our ACL2018 paper "End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions". …☆15May 28, 2018Updated 7 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- ☆13Mar 22, 2015Updated 11 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 6 months ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- 高通AR的demo☆14Nov 25, 2016Updated 9 years ago
- Implements a proof-of-concept of a multi-level clustering algorithm designed to enable extremely fast approximate match search in a large…☆12Feb 24, 2013Updated 13 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- ObamaNet fork☆12Sep 16, 2019Updated 6 years ago
- train code of sepconv☆10Feb 20, 2019Updated 7 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year