linto-ai / linto-desktoptools-hmgLinks
GUI Tool to create, manage and test Keyword Spotting models using TF 2.0
☆13Updated 4 years ago
Alternatives and similar repositories for linto-desktoptools-hmg
Users that are interested in linto-desktoptools-hmg are comparing it to the libraries listed below
Sorting:
- Buildings block for voice-enabled applications in the browser☆37Updated 6 months ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated 2 months ago
- Wake word detection with custom phrases without model training☆21Updated 2 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago
- torchlogic is a pytorch framework for developing Neuro-Symbolic AI systems and implements Neural Reasoning Networks.☆13Updated last month
- streaming speech to text server using Whisper☆95Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆46Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆35Updated 7 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- A minimalist hotword / wake word for the web, based on Porcupine☆61Updated 2 months ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last month
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]☆33Updated 2 weeks ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 11 months ago
- Speaker diarization service☆24Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆71Updated 4 months ago
- ☆50Updated this week
- ☆19Updated 8 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- Open TTS models, built for streaming on the edge☆44Updated 7 months ago
- Local LLaMAs/Models in VSCode☆54Updated 2 years ago