BBC-Esq/WhisperS2T-transcriber

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BBC-Esq/WhisperS2T-transcriber)

BBC-Esq / WhisperS2T-transcriber

Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files

☆78

Alternatives and similar repositories for WhisperS2T-transcriber

Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shashikg / WhisperS2T
View on GitHub
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆577Aug 27, 2024Updated last year
becem-gharbi / directus-starter
View on GitHub
Directus starter for Nuxt 3
☆12Mar 18, 2024Updated 2 years ago
mwt / monitor-dnsupdate-cf
View on GitHub
Monitor a site with two origin servers and rotate DNS when the primary one goes down.
☆12May 20, 2022Updated 4 years ago
mobiusml / faster-whisper
View on GitHub
Faster Whisper ASR transcription with CTranslate2
☆25Oct 25, 2024Updated last year
BBC-Esq / PyQt6-PDF-Viewer
View on GitHub
A simple PDF viewer created with PyQt6 that you can use by itself or incorporate in other scripts. Hard to find!
☆17Mar 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ironmansoftware / restore
View on GitHub
Ctrl+Shift+T for PowerShell Terminals
☆12Sep 28, 2020Updated 5 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
pelikhan / action-continuous-translation
View on GitHub
Automatically maintains translation of markdown files using GenAI.
☆15Updated this week
kaistmm / voxsim_trainer
View on GitHub
[INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset
☆24Sep 29, 2025Updated 10 months ago
ina-foss / InaGVAD
View on GitHub
Voice activity detection and speaker gender segmentation audiovisual corpus
☆16Jan 20, 2025Updated last year
alphacep / whisper-prompts
View on GitHub
OpenAI Whisper Prompt Examples
☆53Jul 17, 2023Updated 3 years ago
alphacep / unimrcp-vosk-plugin
View on GitHub
Open source cross-platform implementation of MRCP protocol
☆20Mar 3, 2022Updated 4 years ago
Deep-unlearning / nano-cohere-transcribe
View on GitHub
Pure-PyTorch inference for CohereLabs/cohere-transcribe-03-2026 (2B Conformer + Transformer ASR, 14 languages).
☆37Apr 29, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gweltou / anaouder-cli
View on GitHub
Anaouder mouezh e Brezhoneg gant Vosk
☆15Nov 24, 2025Updated 8 months ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
vtempest / tesseract-ocr-sample
View on GitHub
Tesseract OCR Sample (Visual Studio) with Leptonica Preprocessing.
☆21Feb 11, 2019Updated 7 years ago
zhou-feifei / parakeet-tdt-0.6b-v2-Batch-Transcriber
View on GitHub
A high-performance batch audio transcription tool using nvidia/parakeet-tdt-0.6b-v2 to generate accurate, well-segmented SRT subtitles, w…
☆18Dec 9, 2025Updated 7 months ago
silencelamb / naked_llama
View on GitHub
build llama inference compute from scrath, only using torch/numpy base ops
☆16May 5, 2026Updated 2 months ago
DiffAPF / LA-2A
View on GitHub
Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".
☆24Jun 10, 2024Updated 2 years ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bekirbakar / replay-attack-detection
View on GitHub
Deep learning-based audio spoofing attack detection experiments for speaker verification.
☆14Apr 20, 2023Updated 3 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
View on GitHub
☆18Mar 13, 2024Updated 2 years ago
Wataru-Nakata / ssl-vocoders
View on GitHub
Implementation of vocoders empowered with pytorch lightning
☆18Jan 27, 2024Updated 2 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
light1726 / BetaVAE_VC
View on GitHub
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆43Apr 10, 2023Updated 3 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
ihtfw / TeamViewer.QuickSupport.Integration
View on GitHub
TeamViewer QuickSupport Integration for .net applications
☆11Jan 20, 2022Updated 4 years ago
virex-84 / VoskIdentification
View on GitHub
Тестовый пример задействования модели для идентификации голоса с помощью библиотеки распознавания речи "Vosk" (Воск): https://alphacephei…
☆12Aug 14, 2023Updated 2 years ago
gwh22 / LAFMA
View on GitHub
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆44Jun 13, 2024Updated 2 years ago