sanchit-gandhi/whisper-jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sanchit-gandhi/whisper-jax)

sanchit-gandhi / whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

☆4,685

Alternatives and similar repositories for whisper-jax

Users that are interested in whisper-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,609Nov 19, 2025Updated 8 months ago
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,099Jan 8, 2025Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,215Aug 19, 2024Updated last year
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,320Jul 13, 2026Updated 2 weeks ago
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆52,406Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,997Oct 25, 2025Updated 9 months ago
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆106,032Updated this week
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,351Updated this week
Stability-AI / StableLM
View on GitHub
StableLM: Stability AI Language Models
☆15,685Apr 8, 2024Updated 2 years ago
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,826Updated this week
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,508May 1, 2026Updated 2 months ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,836Aug 16, 2024Updated last year
OpenNMT / CTranslate2
View on GitHub
Fast inference engine for Transformer models
☆4,596Jul 3, 2026Updated 3 weeks ago
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,527Mar 3, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AIGC-Audio / AudioGPT
View on GitHub
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
☆10,171Jul 6, 2024Updated 2 years ago
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,196Updated this week
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,614Feb 23, 2026Updated 5 months ago
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,396Aug 17, 2024Updated last year
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,484Jun 7, 2025Updated last year
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,501Jun 2, 2026Updated last month
chidiwilliams / buzz
View on GitHub
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆20,546Updated this week
deep-floyd / IF
View on GitHub
☆7,806Apr 14, 2024Updated 2 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,913Jul 29, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Vision-CAIR / MiniGPT-4
View on GitHub
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,651Sep 2, 2024Updated last year
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
zilliztech / GPTCache
View on GitHub
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
☆8,117Jul 11, 2025Updated last year
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,399May 27, 2025Updated last year
langchain-ai / langchain
View on GitHub
The agent engineering platform.
☆142,699Updated this week
databrickslabs / dolly
View on GitHub
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
☆10,805Jun 30, 2023Updated 3 years ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,791Jul 16, 2026Updated 2 weeks ago
yoheinakajima / babyagi
View on GitHub
☆22,340Jan 31, 2026Updated 5 months ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,249Sep 30, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆23,009Jul 23, 2026Updated last week
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,975Jun 10, 2024Updated 2 years ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,322Aug 10, 2024Updated last year
openai / chatgpt-retrieval-plugin
View on GitHub
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
☆21,191Jul 4, 2024Updated 2 years ago
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,866Nov 19, 2024Updated last year
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,696May 21, 2026Updated 2 months ago
Softcatala / whisper-ctranslate2
View on GitHub
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,332Feb 14, 2026Updated 5 months ago