wangzhaode / mnn-asrView external linksLinks
mnn asr demo.
☆25Mar 24, 2025Updated 10 months ago
Alternatives and similar repositories for mnn-asr
Users that are interested in mnn-asr are comparing it to the libraries listed below
Sorting:
- mnn tts demo.☆19May 7, 2025Updated 9 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- segment-anything based mnn☆36Dec 13, 2023Updated 2 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 4 years ago
- ☆15Jul 14, 2020Updated 5 years ago
- ☆10Sep 2, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- ☆12Feb 5, 2024Updated 2 years ago
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 2 months ago
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated last week
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆11Mar 22, 2023Updated 2 years ago
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- DPDFNet: causal single-channel speech enhancement that boosts DeepFilterNet2 with dual-path RNN blocks for stronger long-range temporal a…☆30Updated this week
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Aug 4, 2023Updated 2 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago