Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆73Mar 19, 2026Updated this week
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Faster Whisper ASR transcription with CTranslate2☆24Oct 25, 2024Updated last year
- AI Subtitle Translator relies on AI Models like ChatGPT 4O, Claude 3.7 Sonnet, DeepSeek V3 and Gemini 2.0 to translate your favorite movi…☆10Nov 7, 2025Updated 4 months ago
- Edit by Color by KIRI Engine☆22Nov 7, 2025Updated 4 months ago
- minimal module for launching compute clusters☆24Oct 22, 2015Updated 10 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- BIS (Blender Interplanety Storage) - online materials/shaders library for 3D creation Blender☆21Oct 18, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Oct 17, 2024Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 11 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 9 months ago
- ☆17Apr 14, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- ☆13Sep 12, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Diffusion Probabilistic Model in Jax☆13Apr 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆67Aug 16, 2023Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆34Mar 30, 2025Updated 11 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆34Jul 31, 2024Updated last year
- Short-time Fourier transform (STFT) for JAX☆15Dec 20, 2021Updated 4 years ago
- Websocket controlled Video Overlay server for OBS-Studio, XSplit, CasparCG, ProPresenter and everything with web browser.☆27Sep 14, 2024Updated last year
- ViSH Editor is an HTML5 application to create web presentations in a simple and friendly way.☆48Jun 10, 2020Updated 5 years ago
- ☆20Sep 20, 2024Updated last year