β158Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for translate-with-whisper
Users that are interested in translate-with-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo with scripts to test and play around with Facebook's recent llama models! π€β28Jul 25, 2023Updated 2 years ago
- β13Aug 23, 2024Updated last year
- β26Dec 13, 2024Updated last year
- A python package for whisper normalizerβ76Oct 6, 2025Updated 6 months ago
- β13Dec 7, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β21Sep 27, 2023Updated 2 years ago
- β357Mar 17, 2024Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Unitsβ18Oct 2, 2024Updated last year
- Align, a general text alignment functionβ15Dec 7, 2023Updated 2 years ago
- β62Jul 25, 2024Updated last year
- Python code which creates a semantic search bot over any available corpusβ17May 22, 2023Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Aug 14, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β25Aug 1, 2025Updated 9 months ago
- β17Jan 30, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Open source cross-platform implementation of MRCP protocolβ20Mar 3, 2022Updated 4 years ago
- AI narratorβ15Nov 24, 2023Updated 2 years ago
- Place where folks can contribute to π€ community eventsβ429Dec 7, 2023Updated 2 years ago
- β15Sep 15, 2023Updated 2 years ago
- β38Jun 19, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β33Apr 22, 2026Updated last week
- β21Jul 15, 2024Updated last year
- Repository for all things LLM relatedβ13Dec 31, 2024Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ68Jan 7, 2023Updated 3 years ago
- β61Nov 4, 2023Updated 2 years ago
- Text-to-Speech Benchmarkβ24Apr 2, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisperβ33Jul 28, 2024Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ14Apr 6, 2025Updated last year
- β128Mar 19, 2025Updated last year
- Gradio UI for a Cog APIβ70Apr 8, 2024Updated 2 years ago
- β19May 6, 2023Updated 2 years ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,688Apr 3, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β17Aug 5, 2025Updated 9 months ago
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accβ¦β77Jul 16, 2023Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translationβ25Dec 12, 2024Updated last year
- β14Sep 20, 2023Updated 2 years ago
- β560Jul 10, 2024Updated last year
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) modelβ10Aug 24, 2025Updated 8 months ago