Picovoice / octopusLinks
On-device Speech-to-Index engine powered by deep learning
☆36Updated last month
Alternatives and similar repositories for octopus
Users that are interested in octopus are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆70Updated 3 weeks ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆216Updated 3 weeks ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆14Updated last year
- A curated list of awesome voice activity detection☆54Updated 6 months ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- On-device speaker diarization powered by deep learning☆46Updated 3 weeks ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆127Updated 6 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆41Updated 4 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 4 months ago
- An even smaller speech recognizer / force aligner☆33Updated 5 months ago
- ☆14Updated 2 years ago
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Manage audio and video datasets☆30Updated last week
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- A collection of pre-built speech synthesis settings used to convey emotion☆11Updated 5 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆29Updated 6 months ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago