Picovoice / octopusLinks
On-device Speech-to-Index engine powered by deep learning
β37Updated 6 months ago
Alternatives and similar repositories for octopus
Users that are interested in octopus are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learningβ76Updated 3 months ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- π Coqui's machine learning job schedulerβ31Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learningβ233Updated last month
- Web app for keyword spotting using TensorflowJSβ74Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β134Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- On-device speaker diarization powered by deep learningβ57Updated 3 months ago
- Joint speech-language model - respond directly to audio!β30Updated last year
- β43Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- A minimalist hotword / wake word for the web, based on Porcupineβ61Updated 2 months ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β35Updated 2 years ago
- On-device speaker recognition engine powered by deep learningβ37Updated 3 months ago
- An even smaller speech recognizer / force alignerβ36Updated 10 months ago
- πΉ pyannote + π notebook = pyannotebookβ26Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- On-device streaming text-to-speech engine powered by deep learningβ122Updated 2 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 5 years ago
- β162Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated 3 weeks ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Zero-shot Audio Classification using Whisperβ78Updated 2 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScβ¦β31Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- Open TTS models, built for streaming on the edgeβ44Updated 7 months ago
- Lyra V2 (SoundStream) running in the browserβ18Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year