openai / whisperLinks
Robust Speech Recognition via Large-Scale Weak Supervision
☆90,019Updated last month
Alternatives and similar repositories for whisper
Users that are interested in whisper are comparing it to the libraries listed below
Sorting:
- Faster Whisper transcription with CTranslate2☆18,757Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆18,297Updated last week
- Port of OpenAI's Whisper model in C/C++☆44,056Updated this week
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆15,506Updated last week
- 🔊 Text-Prompted Generative Audio Model☆38,636Updated last year
- 🦜🔗 Build context-aware reasoning applications☆117,729Updated last week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆43,116Updated last year
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆154,818Updated this week
- A latent text-to-image diffusion model☆71,672Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,579Updated 7 months ago
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆9,822Updated last year
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆40,318Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,128Updated 6 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,637Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,829Updated 5 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,894Updated 4 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,185Updated 4 months ago
- A fast, local neural text to speech system☆10,171Updated 2 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆44,895Updated this week
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆8,556Updated this week
- Open-source search and retrieval database for AI applications.☆24,045Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,681Updated 11 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆31,383Updated this week
- LLM inference in C/C++☆88,212Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆16,330Updated 3 weeks ago
- CLI platform to experiment with codegen. Precursor to: https://lovable.dev☆54,955Updated 5 months ago
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆26,100Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆40,461Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,225Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆51,531Updated this week