Picovoice / octopusLinks
On-device Speech-to-Index engine powered by deep learning
☆37Updated 4 months ago
Alternatives and similar repositories for octopus
Users that are interested in octopus are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆74Updated 3 weeks ago
- An even smaller speech recognizer / force aligner☆35Updated 8 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆227Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆120Updated 3 weeks ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- On-device speaker diarization powered by deep learning☆52Updated 3 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Web App to transcribe memos using Whisper AI.☆18Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆68Updated 4 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- ☆43Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆29Updated last week
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆25Updated 6 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆33Updated this week
- Manage audio and video datasets☆31Updated 3 weeks ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 3 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week