Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆74Mar 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Faster Whisper ASR transcription with CTranslate2☆24Oct 25, 2024Updated last year
- Directus starter for Nuxt 3☆12Mar 18, 2024Updated 2 years ago
- AI Subtitle Translator relies on AI Models like ChatGPT 4O, Claude 3.7 Sonnet, DeepSeek V3 and Gemini 2.0 to translate your favorite movi…☆11Nov 7, 2025Updated 5 months ago
- Edit by Color by KIRI Engine☆22Nov 7, 2025Updated 5 months ago
- minimal module for launching compute clusters☆24Oct 22, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Monitor a site with two origin servers and rotate DNS when the primary one goes down.☆12May 20, 2022Updated 3 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- BIS (Blender Interplanety Storage) - online materials/shaders library for 3D creation Blender☆21Oct 18, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 3 years ago
- ☆14Aug 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆13Sep 29, 2025Updated 6 months ago
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 11 months ago
- A Hypertrie wrapper that supports mounting of other Hypertries☆25Oct 9, 2020Updated 5 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 9 months ago
- A simple, accessible and offline real-time transcription app for Android.☆14Oct 1, 2024Updated last year
- ☆17Apr 14, 2023Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- ☆13Sep 12, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 8 months ago
- Diffusion Probabilistic Model in Jax☆13Apr 20, 2024Updated last year
- ☆68Aug 16, 2023Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆35Mar 30, 2025Updated last year
- Short-time Fourier transform (STFT) for JAX☆15Dec 20, 2021Updated 4 years ago