Picovoice / octopus
On-device Speech-to-Index engine powered by deep learning
☆36Updated 2 weeks ago
Alternatives and similar repositories for octopus:
Users that are interested in octopus are comparing it to the libraries listed below
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- On-device noise suppression powered by deep learning☆67Updated 2 weeks ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- proof of concept conversation orchestrator with a speech-language model☆18Updated 4 months ago
- A curated list of awesome voice activity detection☆40Updated 3 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- ☆14Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 9 months ago
- On-device voice activity detection (VAD) powered by deep learning☆201Updated last week
- On-device speaker diarization powered by deep learning☆37Updated 3 weeks ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆37Updated last year
- A collection of pre-built speech synthesis settings used to convey emotion☆11Updated 5 years ago
- Web app for keyword spotting using TensorflowJS☆70Updated 2 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆29Updated 3 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Chord conditioning implemented MusicGen☆55Updated 11 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆23Updated 6 years ago
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆40Updated 4 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 4 months ago
- On-device speaker recognition engine powered by deep learning☆32Updated 3 weeks ago
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]☆49Updated last week
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆9Updated 2 years ago