yuekaizhang / minutesLinks
Podcast Summarizer with LLM Technology
☆25Updated 5 months ago
Alternatives and similar repositories for minutes
Users that are interested in minutes are comparing it to the libraries listed below
Sorting:
- ☆33Updated 3 years ago
 - Python Wrapper of Silero VAD☆61Updated 5 months ago
 - Python runtime for WeTextProcessing (does not depend on Pynini)☆33Updated 3 weeks ago
 - Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆14Updated 10 months ago
 - Colab notebooks for Next-gen Kaldi☆29Updated 3 weeks ago
 - A enterprise-grade Voice Activity Detector from modelscope and funasr.☆114Updated 2 years ago
 - CTC decoder with hotwords for ASR.☆31Updated 6 months ago
 - Decoders from Kaldi using OpenFst☆34Updated 2 months ago
 - 将任意人的音色转换为成千上万种不同音色☆32Updated 2 years ago
 - SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆91Updated last year
 - ☆20Updated 2 months ago
 - Project of Singing Voice Conversion.☆15Updated 2 years ago
 - ☆23Updated last year
 - Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆22Updated last year
 - noise reduction☆17Updated last year
 - Chinese and English Bilinguish G2P☆21Updated 2 years ago
 - A enterprise-grade Chinese-English code switch punctuator from funasr.☆28Updated last year
 - A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆42Updated 7 months ago
 - We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
 - [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆37Updated last year
 - paraformer(chinense asr) online onnx runtime for python☆53Updated last year
 - Streaming Text to Speech Web UI☆22Updated last year
 - 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
 - video cut powered by AI☆25Updated 2 years ago
 - ☆12Updated 4 years ago
 - Port of Funasr's Paraformer model in C/C++☆35Updated last year
 - 单独维护的中文TTS☆35Updated 3 years ago
 - Cantonese Text to Speech with VITS implementation☆36Updated 2 years ago
 - The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
 - faster inference☆28Updated 9 months ago