whisper.cpp bindings for python
☆112Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-cpp-python
Users that are interested in whisper-cpp-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python bindings for whisper.cpp☆340Jun 4, 2026Updated last week
- Python bindings for whisper.cpp☆248Jun 1, 2024Updated 2 years ago
- Offline srt producer gui with whisper.cpp☆25Dec 31, 2023Updated 2 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- stable-diffusion.cpp bindings for python☆116May 12, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 3 years ago
- Python bindings for llama.cpp☆10,388Updated this week
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆546Nov 6, 2023Updated 2 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- ☆10Apr 4, 2023Updated 3 years ago
- ☆27Nov 3, 2025Updated 7 months ago
- ☆11Jul 3, 2022Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆15Mar 15, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆35Apr 14, 2026Updated 2 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX optimization☆14Jun 5, 2025Updated last year
- Prompt Engineering'i öğrenebileceğiniz Erdoğan Eker ile birlikte oluşturduğumuz bir kitap.☆12Sep 9, 2024Updated last year
- End-to-end example of training, exporting and deploying a fastai model to a native iOS app☆11Mar 2, 2023Updated 3 years ago
- Plug n Play GBNF Compiler for llama.cpp☆32Nov 8, 2023Updated 2 years ago
- ☆11Sep 5, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Sample code to demonstrate how to implement a simple chat that works on .NET MAUI, Blazor and Blazor Hybrid with SignalR☆10Feb 14, 2023Updated 3 years ago
- Remove the handwriting of WPI Images with inpainting.☆25Feb 13, 2023Updated 3 years ago
- Simple diarization model☆53Jun 13, 2025Updated last year
- pico w powered led matrix with mvg departure information☆11Oct 23, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆643Mar 9, 2026Updated 3 months ago
- Text Classification model deployment using FastAPI, Streamlit and Docker Compose☆14Feb 12, 2021Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Social previews generator as a microservice.☆12Apr 9, 2022Updated 4 years ago
- ☆10Feb 1, 2023Updated 3 years ago
- PlayList Animation using MAUI.☆14Nov 1, 2022Updated 3 years ago
- Learning Pytorch☆13Jun 12, 2018Updated 8 years ago
- Um repositório com o modelo LaTeX para a criação de trabalhos acadêmicos.☆10Oct 19, 2017Updated 8 years ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆864Nov 16, 2024Updated last year