Vaibhavs10/translate-with-whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Vaibhavs10/translate-with-whisper)

Vaibhavs10 / translate-with-whisper

☆157

Alternatives and similar repositories for translate-with-whisper

Users that are interested in translate-with-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Vaibhavs10 / on-device-llm-playground
View on GitHub
A repo with scripts to test and play around with Facebook's recent llama models! 🤗
☆28Jul 25, 2023Updated 3 years ago
fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
Narsil / hf-chat
View on GitHub
☆25Dec 13, 2024Updated last year
kurianbenoy / whisper_normalizer
View on GitHub
A python package for whisper normalizer
☆79Jul 17, 2026Updated last week
camenduru / background-replacement-colab
View on GitHub
☆21Sep 27, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
knoriy / CLARA
View on GitHub
☆62Jul 25, 2024Updated 2 years ago
AI4Bharat / IndicVoices
View on GitHub
☆19Feb 22, 2026Updated 5 months ago
miguelvalente / whisperer
View on GitHub
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆137Aug 14, 2023Updated 2 years ago
Pwntus / replicate-narrator
View on GitHub
AI narrator
☆15Nov 24, 2023Updated 2 years ago
alphacep / unimrcp-vosk-plugin
View on GitHub
Open source cross-platform implementation of MRCP protocol
☆20Mar 3, 2022Updated 4 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
rlancemartin / karpathy-gpt
View on GitHub
☆38Jun 19, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CodeAlchemyAI / yt-to-blog
View on GitHub
☆15Sep 15, 2023Updated 2 years ago
multimodalart / grog
View on GitHub
Gradio UI for a Cog API
☆71Apr 8, 2024Updated 2 years ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
Vaibhavs10 / notebooks
View on GitHub
☆127Mar 19, 2025Updated last year
LAION-AI / Text-to-speech
View on GitHub
☆61Nov 4, 2023Updated 2 years ago
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆14Apr 6, 2025Updated last year
dioco-group / jenny-tts-dataset
View on GitHub
A high-quality, varied ~30hr voice dataset suitable for training a TTS model
☆70Jan 7, 2023Updated 3 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
huggingface / community-events
View on GitHub
Place where folks can contribute to 🤗 community events
☆427Dec 7, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
mesolitica / vllm-whisper
View on GitHub
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆35Jul 28, 2024Updated last year
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
SakanaAI / kame_finetune
View on GitHub
☆30Jul 16, 2026Updated last week
moomou / listening-with-llm
View on GitHub
☆17Jan 30, 2024Updated 2 years ago
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
mrcolo / longboii
View on GitHub
☆18May 6, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sanchit-gandhi / whisper-jax
View on GitHub
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
☆4,685Apr 3, 2024Updated 2 years ago
yuh-zha / Align
View on GitHub
Align, a general text alignment function
☆15Dec 7, 2023Updated 2 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
ALucek / rl-for-llms
View on GitHub
Context & Guide For Reinforcement Learning with Verifiable Rewards with Large Language Models
☆19Nov 3, 2025Updated 8 months ago
Srijith-rkr / Whispering-LLaMA
View on GitHub
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
☆271May 19, 2024Updated 2 years ago
luweigen / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆121Jan 29, 2024Updated 2 years ago