Whisper combined with Silero VAD, for improved long-form transcriptions
☆54Dec 11, 2022Updated 3 years ago
Alternatives and similar repositories for WhisperWithVAD
Users that are interested in WhisperWithVAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- ☆14Feb 18, 2022Updated 4 years ago
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple LPC vocoder in Python☆13Jan 7, 2022Updated 4 years ago
- generate granular word-level captions in srt format☆57Sep 26, 2022Updated 3 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- ☆14Feb 9, 2023Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆13Oct 28, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- Datasets for turn-taking research☆19Dec 21, 2023Updated 2 years ago
- ☆10Oct 25, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 8, 2020Updated 5 years ago
- ☆11May 23, 2023Updated 2 years ago
- convert .lab files to .TextGrid files, which can be used in Praat☆14Nov 2, 2018Updated 7 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- NVIDIA's TalkNET - Train and Synthesize on colab☆15Dec 6, 2025Updated 3 months ago
- ☆49Apr 28, 2023Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆21May 17, 2023Updated 2 years ago
- A simple booking system, developed in screenful-sized steps☆13Oct 1, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- 本サンプルコードは「ゼロから学ぶスパイキングニューラルネットワーク」で取り扱っているコードをまとめたものです.☆18Jan 2, 2021Updated 5 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 9 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简 单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆65May 18, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- A cheap and simple runtime Animator for skeletal meshes in Unity.☆20Oct 2, 2024Updated last year
- 🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and t…☆23Mar 16, 2026Updated last week
- ☆14Aug 9, 2021Updated 4 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- Code repository for Qlik Sense Cookbook, published by Packt☆12Jan 18, 2023Updated 3 years ago