whisper.cpp bindings for python
☆109Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-cpp-python
Users that are interested in whisper-cpp-python are comparing it to the libraries listed below
Sorting:
- Python bindings for whisper.cpp☆327Feb 20, 2026Updated 2 weeks ago
- Pybind11 bindings for Whisper.cpp☆343Dec 8, 2024Updated last year
- Python bindings for whisper.cpp☆249Jun 1, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- ☆26Nov 3, 2025Updated 4 months ago
- LocalAI website, powered by Hugo☆14Nov 22, 2023Updated 2 years ago
- An open source Java implementation to interpret and render Computer Graphics Metafile (CGM) graphics files.☆15Jun 20, 2025Updated 8 months ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Research repository for TouchPose: Hand Pose Prediction, Depth Estimation, and Touch Classification from Capacitive Images. ACM UIST 2021…☆21Dec 14, 2021Updated 4 years ago
- ☆12Aug 15, 2022Updated 3 years ago
- ☆11Sep 5, 2025Updated 6 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- Minimal user-friendly demo of OpenAI's CLIP for semantic image search☆19Sep 28, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆25Oct 9, 2024Updated last year
- Benchmark results from code generation with LLMs☆17Sep 1, 2023Updated 2 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆21Mar 21, 2022Updated 3 years ago
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Sep 7, 2023Updated 2 years ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆619Feb 17, 2025Updated last year
- Simple diarization model☆53Jun 13, 2025Updated 8 months ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seq☆18Sep 5, 2019Updated 6 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆539Nov 6, 2023Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- A python module to process data for Frame Semantic Parsing☆23Nov 3, 2020Updated 5 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- simple to use, pretrained/training-less models for speaker diarization☆21Aug 23, 2023Updated 2 years ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆857Nov 16, 2024Updated last year
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆34Nov 18, 2025Updated 3 months ago