openai / whisperLinks
Robust Speech Recognition via Large-Scale Weak Supervision
☆94,315Updated last month
Alternatives and similar repositories for whisper
Users that are interested in whisper are comparing it to the libraries listed below
Sorting:
- Port of OpenAI's Whisper model in C/C++☆46,518Updated this week
- Faster Whisper transcription with CTranslate2☆20,833Updated 2 months ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,078Updated 8 months ago
- LLM inference in C/C++☆94,823Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,051Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,516Updated last year
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,456Updated last year
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,593Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.☆46,006Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,392Updated 8 months ago
- Inference code for Llama models☆59,141Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,762Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,265Updated last year
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆17,700Updated last week
- 🦜🔗 The platform for reliable agents.☆126,317Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,703Updated this week
- Stable Diffusion web UI☆160,424Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,426Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,578Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,971Updated 10 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆46,841Updated this week
- 🔊 Text-Prompted Generative Audio Model☆38,961Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆102,600Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,745Updated last year
- Universal LLM Deployment Engine with ML Compilation☆22,012Updated this week
- Let us control diffusion models!☆33,621Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,978Updated last year
- Tensor library for machine learning☆13,923Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆69,622Updated this week
- The official Meta Llama 3 GitHub site☆29,240Updated last year