solarsamuel / pi5_whisper_voice_assistantView external linksLinks
This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4
☆24Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for pi5_whisper_voice_assistant
Users that are interested in pi5_whisper_voice_assistant are comparing it to the libraries listed below
Sorting:
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated 8 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- ☆25Jun 14, 2022Updated 3 years ago
- [2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line☆32Mar 6, 2023Updated 2 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- A Raspberry Pi 5-based smart facial recognition door access system integrating OpenCV face detection, servo control, and voice feedback m…☆10Oct 28, 2024Updated last year
- Transcription and annotation interface for recorded audio or video files☆52Updated this week
- 4K Video Player for Raspberry Pi 5 for standalone installation☆14Nov 5, 2024Updated last year
- This is an example of how to implement ruffle in your own website.☆13Oct 15, 2023Updated 2 years ago
- AI DJ Mix Generator - a fully automated system that creates a mix from input of songs closely resembling real life djs work. Includes adv…☆16Jul 2, 2025Updated 7 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- A collection of audio autoencoders, in PyTorch.☆44Mar 7, 2023Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- ☆12Apr 13, 2024Updated last year
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- HumanAI github site☆18Updated this week
- Repository containing scripts/helpers for configuring a Raspberry Pi to work with XMOS mic frontend☆14Jul 31, 2023Updated 2 years ago
- Whisper finetuning☆15Apr 9, 2025Updated 10 months ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- A Terraform Version Manager written in Go☆11Oct 3, 2023Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Skeleton Vulkan project using SDL / C++ 🔺 a "Hello Triangle" sample code demo plus examples from "Vulkan Tutorial" (iOS/macOS, Windows, …☆12Jul 11, 2024Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- ☆13Oct 9, 2025Updated 4 months ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Raspbot V2 AI Vision Robot Car for Raspberry Pi 5☆16Sep 10, 2025Updated 5 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Contains source code for the Udemy lecture "Applied Yocto Project using Raspberry Pi 5"☆15Jan 19, 2025Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆20Feb 9, 2026Updated last week
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- Small samples for the Procedural Motion package☆16Mar 19, 2025Updated 10 months ago
- ☆11Nov 7, 2024Updated last year