β158Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for translate-with-whisper
Users that are interested in translate-with-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo with scripts to test and play around with Facebook's recent llama models! π€β28Jul 25, 2023Updated 2 years ago
- β13Aug 23, 2024Updated last year
- This contains a practical guide for non-technical users on how to use OpenAI's Whisper for transcription and translationβ12May 8, 2024Updated 2 years ago
- β26Dec 13, 2024Updated last year
- A python package for whisper normalizerβ79Oct 6, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β13Dec 7, 2022Updated 3 years ago
- β21Sep 27, 2023Updated 2 years ago
- β357Mar 17, 2024Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Unitsβ18Oct 2, 2024Updated last year
- Align, a general text alignment functionβ15Dec 7, 2023Updated 2 years ago
- β62Jul 25, 2024Updated last year
- Python code which creates a semantic search bot over any available corpusβ17May 22, 2023Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Aug 14, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β25Aug 1, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β17Jan 30, 2024Updated 2 years ago
- Open source cross-platform implementation of MRCP protocolβ20Mar 3, 2022Updated 4 years ago
- β15Sep 15, 2023Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β17May 16, 2025Updated last year
- β38Jun 19, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β34Apr 22, 2026Updated last month
- β21Jul 15, 2024Updated last year
- Repository for all things LLM relatedβ13Dec 31, 2024Updated last year
- Kaazing Websocket Gateway integrated with the RaspberryPiβ30Nov 20, 2013Updated 12 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- β61Nov 4, 2023Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ69Jan 7, 2023Updated 3 years ago
- Text-to-Speech Benchmarkβ26Apr 2, 2026Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisperβ34Jul 28, 2024Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ14Apr 6, 2025Updated last year
- β127Mar 19, 2025Updated last year
- Gradio UI for a Cog APIβ70Apr 8, 2024Updated 2 years ago
- β19May 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,686Apr 3, 2024Updated 2 years ago
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accβ¦β77Jul 16, 2023Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translationβ25Dec 12, 2024Updated last year
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.β66Sep 8, 2025Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Jan 29, 2024Updated 2 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correctionβ271May 19, 2024Updated 2 years ago