Picovoice / octopus
On-device Speech-to-Index engine powered by deep learning
☆36Updated this week
Alternatives and similar repositories for octopus:
Users that are interested in octopus are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆39Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆202Updated this week
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- On-device noise suppression powered by deep learning☆68Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- A curated list of awesome voice activity detection☆43Updated 4 months ago
- Web app for keyword spotting using TensorflowJS☆70Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- proof of concept conversation orchestrator with a speech-language model☆19Updated 5 months ago
- Buildings block for voice-enabled applications in the browser☆36Updated last month
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆37Updated last year
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆40Updated 4 years ago
- Manage audio and video datasets☆28Updated 2 weeks ago
- On-device speaker recognition engine powered by deep learning☆32Updated this week
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆28Updated 8 months ago