Picovoice / octopus
On-device Speech-to-Index engine powered by deep learning
β36Updated last month
Alternatives and similar repositories for octopus:
Users that are interested in octopus are comparing it to the libraries listed below
- On-device noise suppression powered by deep learningβ69Updated this week
- On-device voice activity detection (VAD) powered by deep learningβ206Updated last week
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- On-device speaker diarization powered by deep learningβ43Updated last month
- Lyra V2 (SoundStream) running in the browserβ19Updated last year
- An even smaller speech recognizer / force alignerβ32Updated 4 months ago
- A collection of pre-built speech synthesis settings used to convey emotionβ11Updated 5 years ago
- Web app for keyword spotting using TensorflowJSβ71Updated 2 years ago
- Joint speech-language model - respond directly to audio!β30Updated 11 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β125Updated 5 months ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrumeβ¦β15Updated last year
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesisβ41Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- β14Updated last year
- Open TTS models, built for streaming on the edgeβ39Updated last month
- Tracking states of the arts and recent results (bibliography) on sound tasks.β32Updated 2 years ago
- gentle forced alignerβ11Updated 11 months ago
- A repo with scripts to test and play around with Facebook's recent llama models! π€β28Updated last year
- Web App to transcribe memos using Whisper AI.β18Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated last week
- Buildings block for voice-enabled applications in the browserβ37Updated 2 months ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- proof of concept conversation orchestrator with a speech-language modelβ19Updated 5 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activitβ¦β21Updated 3 months ago
- Zero-shot Audio Classification using Whisperβ80Updated 2 years ago
- A curated list of awesome voice activity detectionβ48Updated 4 months ago