sanchit-gandhi / whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
☆4,355Updated 5 months ago
Related projects: ⓘ
- Faster Whisper transcription with CTranslate2☆11,378Updated 3 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆11,412Updated 3 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,495Updated 2 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆10,755Updated last month
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆4,714Updated last month
- ☆7,273Updated 3 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,194Updated last month
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆3,315Updated 2 weeks ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,553Updated 7 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆3,772Updated 3 months ago
- Fast inference engine for Transformer models☆3,218Updated this week
- Real time transcription with OpenAI Whisper.☆2,260Updated 3 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,079Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMs☆9,906Updated 3 months ago
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆9,973Updated 2 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆5,919Updated last week
- Home of StarCoder: fine-tuning & inference!☆7,257Updated 6 months ago
- Python bindings for llama.cpp☆7,723Updated this week
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,055Updated 3 months ago
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,217Updated this week
- Open source codebase powering the HuggingChat app☆7,202Updated this week
- ImageBind One Embedding Space to Bind Them All☆8,221Updated last month
- Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.☆9,731Updated last week
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,931Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,339Updated last year
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,085Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,200Updated this week
- StableLM: Stability AI Language Models☆15,842Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- Large Language Model Text Generation Inference☆8,762Updated this week