Whisper combined with Silero VAD, for improved long-form transcriptions
☆54Dec 11, 2022Updated 3 years ago
Alternatives and similar repositories for WhisperWithVAD
Users that are interested in WhisperWithVAD are comparing it to the libraries listed below
Sorting:
- zero shot NER fine tuning☆14Mar 17, 2025Updated 11 months ago
- Collaborative transcription service that keeps getting better☆23Nov 8, 2023Updated 2 years ago
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆26Nov 7, 2023Updated 2 years ago
- ☆49Apr 28, 2023Updated 2 years ago
- Tistory photo grabber☆24Nov 13, 2021Updated 4 years ago
- Open Source Python SDK for AI Agents Identity☆34Jan 20, 2026Updated last month
- Solution to Data Analytics Essentials course by Cisco☆13Dec 26, 2023Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- AviSynth CUDA Filters☆35Mar 24, 2019Updated 6 years ago
- Download, browse and delete models in ComfyUI.☆12Oct 9, 2024Updated last year
- attempt to perma root the NEC Terrain android phone☆10Jul 24, 2015Updated 10 years ago
- ☆86Jul 31, 2025Updated 7 months ago
- SR-VAE☆10Jul 26, 2021Updated 4 years ago
- A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…☆10Feb 17, 2025Updated last year
- Code for my collection of predictors/classifiers/etc☆14Jul 18, 2024Updated last year
- Anime Character Segmentation☆11Aug 31, 2020Updated 5 years ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Steampunk geese run a parallel processing commune. Surprisingly effective.☆51Feb 25, 2026Updated last week
- ☆14Dec 3, 2025Updated 3 months ago
- ISFMXFW - UI Enhancer For Inno Setup☆10Apr 4, 2023Updated 2 years ago
- Open-source, local-first AI command center. High-performance Rust alternative to Zapier, n8n, and OpenClaw. Orchestrate 25k+ tools using …☆34Updated this week
- The GitHub open source software repository on interpreting super-resolution CNNs for sub-pixel motion compensation in video coding☆11May 20, 2022Updated 3 years ago
- ☆15Jan 17, 2026Updated last month
- Playground site for creating/validating data contracts☆11Aug 9, 2025Updated 6 months ago
- A local-first memory layer for AI (Cursor, Zed, Claude). Persistent architectural context via semantic search.☆105Feb 25, 2026Updated last week
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- A basic WEB chat based on Haskell, HTMX and Tailwindcss☆11Sep 27, 2022Updated 3 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- ☆11Feb 6, 2022Updated 4 years ago
- Semi-automated process to create an audiobook (m4b format) from markdown files.☆11Jan 12, 2017Updated 9 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆11Dec 6, 2023Updated 2 years ago
- s-grid is a helper class useful for creating responsive, fluid grid layouts using CSS Custom Properties.☆10Feb 22, 2017Updated 9 years ago
- ☆12Mar 17, 2020Updated 5 years ago
- Computational time vs quality comparison between some Edge preserving smoothing filters☆10May 5, 2017Updated 8 years ago