yuekaizhang/minutes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuekaizhang/minutes)

yuekaizhang / minutes

Podcast Summarizer with LLM Technology

☆30

Alternatives and similar repositories for minutes

Users that are interested in minutes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
yuekaizhang / Triton-ASR-Client
View on GitHub
ASR client for Triton ASR Service
☆39Jan 12, 2026Updated 6 months ago
FrancoisGrondin / gccphat
View on GitHub
☆17Oct 26, 2018Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
TadaoYamaoka / RealtimeTranscribe
View on GitHub
real-time transcription application
☆12Jun 9, 2023Updated 3 years ago
winlinvip / srs-k2
View on GitHub
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
☆20Apr 16, 2023Updated 3 years ago
dengcunqin / noise-reduction
View on GitHub
noise reduction
☆17Jul 3, 2024Updated 2 years ago
multitel-ai / urban-sound-tagging
View on GitHub
1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context
☆17Dec 8, 2022Updated 3 years ago
winlinvip / ai-translation
View on GitHub
This solution is not good enough, we're researching a better version: https://github.com/winlinvip/vod-translator so we archive this repo…
☆21Apr 17, 2024Updated 2 years ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
liam-kelley / RIR-in-a-Box
View on GitHub
Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…
☆16Sep 1, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
haoheliu / DCASE_2022_Task_5
View on GitHub
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Jul 6, 2022Updated 4 years ago
Ephrem-ETH / E2E-KWS
View on GitHub
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆45Nov 18, 2022Updated 3 years ago
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆20Sep 18, 2025Updated 10 months ago
sithu31296 / audio-tagging
View on GitHub
Easy to use Audio Tagging in PyTorch
☆23Aug 22, 2021Updated 4 years ago
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
FrancoisGrondin / steernet
View on GitHub
☆27May 14, 2020Updated 6 years ago
ronggong / DCASE2017-task1
View on GitHub
Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1
☆11Aug 8, 2017Updated 8 years ago
mediatechlab / tts-wrapper
View on GitHub
TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
☆20Jul 26, 2024Updated last year
rwth-i6 / rasr
View on GitHub
The RWTH ASR Toolkit.
☆59Updated this week
deepakacharyab / gnn_feature_selection_extraction
View on GitHub
☆15Oct 23, 2019Updated 6 years ago
yehudagale / fuzzyJoiner
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
yyj2013 / webrtc_vad_for_mobile
View on GitHub
This is a effective VAD(Voice Activity Detection) for iOS & Android. It is port from google webrtc.
☆12Jul 13, 2017Updated 9 years ago
EvelynZhou / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆12Nov 30, 2021Updated 4 years ago
willwade / tts-wrapper
View on GitHub
TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
☆39Feb 20, 2026Updated 5 months ago
nmfisher / sherpa_onnx_dart
View on GitHub
Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter
☆22Jan 3, 2025Updated last year
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
Jackson-Kang / Korean-phoneme-dictionary-generator
View on GitHub
Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)
☆13Feb 27, 2021Updated 5 years ago