☆158Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for translate-with-whisper
Users that are interested in translate-with-whisper are comparing it to the libraries listed below
Sorting:
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Jul 25, 2023Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- ☆21Sep 27, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ☆27Dec 13, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- A python package for whisper normalizer☆75Oct 6, 2025Updated 5 months ago
- ☆62Jul 25, 2024Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 11 months ago
- ☆15Sep 15, 2023Updated 2 years ago
- AI narrator☆15Nov 24, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆21Mar 13, 2023Updated 2 years ago
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated last month
- ☆21Jul 15, 2024Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆138Aug 14, 2023Updated 2 years ago
- ☆357Mar 17, 2024Updated last year
- Place where folks can contribute to 🤗 community events☆429Dec 7, 2023Updated 2 years ago
- ☆19May 6, 2023Updated 2 years ago
- ☆61Nov 4, 2023Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆67Jan 7, 2023Updated 3 years ago
- Gradio UI for a Cog API☆70Apr 8, 2024Updated last year
- ☆127Mar 19, 2025Updated 11 months ago
- ☆17Aug 5, 2025Updated 7 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- [WIP] AI Try-On plugin for Chrome☆28Mar 16, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- ☆40Mar 25, 2024Updated last year
- [Tutorial] Demystifying Natural Language Processing with Python☆23Sep 7, 2019Updated 6 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- ☆558Jul 10, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆13Oct 25, 2024Updated last year
- ☆14Sep 20, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago