openai / whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆78,434Updated 2 months ago
Alternatives and similar repositories for whisper:
Users that are interested in whisper are comparing it to the libraries listed below
- Port of OpenAI's Whisper model in C/C++☆38,606Updated this week
- Faster Whisper transcription with CTranslate2☆14,882Updated 2 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆14,557Updated 2 weeks ago
- 🔊 Text-Prompted Generative Audio Model☆37,245Updated 7 months ago
- Making large AI models cheaper, faster and more accessible☆40,654Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆36,918Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,169Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆38,710Updated 7 months ago
- LLM inference in C/C++☆76,950Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆72,884Updated this week
- 🦜🔗 Build context-aware reasoning applications☆103,849Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆29,881Updated 8 months ago
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆13,957Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,220Updated this week
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆42,957Updated this week
- Inference code for Llama models☆57,912Updated last month
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆55,476Updated 4 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆28,181Updated this week
- SOTA Open Source TTS☆20,165Updated this week
- A latent text-to-image diffusion model☆70,086Updated 9 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆71,950Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.☆133,757Updated this week
- Drag & drop UI to build your customized LLM flow☆36,411Updated this week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,571Updated 11 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,265Updated 7 months ago
- Instruct-tune LLaMA on consumer hardware☆18,842Updated 7 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,691Updated last week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆31,167Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆41,803Updated this week
- Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products wi…☆37,569Updated last week