linto-ai / linto-desktoptools-hmgLinks
GUI Tool to create, manage and test Keyword Spotting models using TF 2.0
☆13Updated 4 years ago
Alternatives and similar repositories for linto-desktoptools-hmg
Users that are interested in linto-desktoptools-hmg are comparing it to the libraries listed below
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆121Updated 2 weeks ago
- Buildings block for voice-enabled applications in the browser☆37Updated 7 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆45Updated 2 weeks ago
- Wake word detection with custom phrases without model training☆22Updated 3 months ago
- An even smaller speech recognizer / force aligner☆36Updated 11 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆34Updated 3 months ago
- whisper.cpp bindings for python☆107Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- ☆51Updated last week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 11 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆233Updated 2 weeks ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Cross-platform audio recorder designed for real-time speech audio processing☆124Updated 3 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- On-device noise suppression powered by deep learning☆77Updated last week
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 5 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆26Updated 3 months ago
- A simple TTS server for generating speech using StyleTTS2☆37Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 4 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- ☆25Updated 10 months ago