yuekaizhang / minutesView external linksLinks
Podcast Summarizer with LLM Technology
☆30May 28, 2025Updated 8 months ago
Alternatives and similar repositories for minutes
Users that are interested in minutes are comparing it to the libraries listed below
Sorting:
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆16Apr 24, 2025Updated 9 months ago
- ☆17Oct 26, 2018Updated 7 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated 11 months ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated 11 months ago
- ☆20Jul 22, 2022Updated 3 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- This solution is not good enough, we're researching a better version: https://github.com/winlinvip/vod-translator so we archive this repo…☆21Apr 17, 2024Updated last year
- ☆25May 14, 2020Updated 5 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- The RWTH ASR Toolkit.☆58Updated this week
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- ☆28Oct 7, 2025Updated 4 months ago
- A benchmark for evaluating audio encoders on various audio tasks.☆43Dec 11, 2025Updated 2 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆141Oct 9, 2025Updated 4 months ago
- This is the official implementation of reverberant speech to room impulse response estimator☆40Aug 7, 2024Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆35Updated this week
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- ☆33Nov 29, 2022Updated 3 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 8 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 2 months ago
- Neural Dereverberation☆36May 22, 2019Updated 6 years ago
- ASR client for Triton ASR Service☆37Jan 12, 2026Updated last month