rhasspy / pymicro-vad
Self-contained voice activity detector
☆26Updated 9 months ago
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- 360M model running in the browser on WebGPU☆21Updated 8 months ago
- Neurox control helm chart details☆31Updated 2 weeks ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆19Updated last month
- Documentation for the Krixik Python client.☆38Updated 6 months ago
- 💭 Chat with AI via API☆31Updated 6 months ago
- Control your Roku with hand gestures using Mediapipe and Python☆18Updated 5 months ago
- Generate ideal question-answers for testing RAG☆126Updated 2 months ago
- LocalScore is an open benchmark which helps you understand how well your computer can handle local AI tasks.☆31Updated last month
- SDK and code generator for the OpenBLE spec☆38Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆19Updated 8 months ago
- ☆19Updated last month
- Flowchart-like UI to interconnect LLM's and Huggingface models, and deploy them as a REST API with little to no code.☆71Updated last month
- ☆54Updated 9 months ago
- Golf is a programming language, framework and application server for high-performance web services and web applications, with focus on …☆44Updated this week
- ☆124Updated 10 months ago
- An experimental project to convert HTML websites into a format compatible with large language models (LLMs), enabling seamless website na…☆22Updated 5 months ago
- ☆91Updated this week
- Search a JSON path and get the value fast☆22Updated 3 months ago
- Visual inference exploration & experimentation playground☆92Updated 5 months ago
- ☆10Updated 11 months ago
- A task management system designed for AI development☆18Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆28Updated last month
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆57Updated last year
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆189Updated 2 months ago
- ☆19Updated 3 months ago
- Detect whether or not an audio file was generated by NotebookLM☆137Updated 5 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- ☆24Updated 2 years ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆54Updated last year