Picovoice / octopusLinks
On-device Speech-to-Index engine powered by deep learning
☆37Updated 3 months ago
Alternatives and similar repositories for octopus
Users that are interested in octopus are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- Web App to transcribe memos using Whisper AI.☆18Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- On-device speaker diarization powered by deep learning☆52Updated 3 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆25Updated 4 months ago
- ☆128Updated 9 months ago
- On-device speaker recognition engine powered by deep learning☆37Updated this week
- 🐸STT integration examples☆130Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆19Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A simple, but performant framework for mapping speech directly to categories and intents.☆21Updated last year
- Buildings block for voice-enabled applications in the browser☆37Updated 3 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- A library for real-time voice processing in web browsers☆226Updated this week
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago