Picovoice / octopus
On-device Speech-to-Index engine powered by deep learning
☆36Updated this week
Alternatives and similar repositories for octopus:
Users that are interested in octopus are comparing it to the libraries listed below
- On-device noise suppression powered by deep learning☆66Updated last week
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- On-device speaker diarization powered by deep learning☆38Updated last week
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆36Updated last year
- Web App to transcribe memos using Whisper AI.☆18Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆30Updated 9 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆28Updated 3 months ago
- proof of concept conversation orchestrator with a speech-language model☆17Updated 4 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Speaker Diarization with Transformers☆64Updated 8 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated last month
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- A collection of pre-built speech synthesis settings used to convey emotion☆11Updated 5 years ago
- ☆21Updated last month
- A very basic demonstration connecting speech recognition and text-to-speech☆19Updated 4 years ago
- A python package for whisper normalizer☆47Updated 2 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 3 months ago
- A JAX library for building lattice-based speech transducer models☆43Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆70Updated this week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year