Picovoice / octopusLinks
On-device Speech-to-Index engine powered by deep learning
☆37Updated 8 months ago
Alternatives and similar repositories for octopus
Users that are interested in octopus are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆78Updated last week
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆25Updated 7 years ago
- On-device voice activity detection (VAD) powered by deep learning☆241Updated last week
- On-device speaker diarization powered by deep learning☆61Updated last week
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆43Updated 2 months ago
- Web App to transcribe memos using Whisper AI.☆18Updated 3 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆36Updated last year
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 6 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- On-device streaming text-to-speech engine powered by deep learning☆126Updated last week
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- gentle forced aligner☆11Updated last year
- Buildings block for voice-enabled applications in the browser☆37Updated 8 months ago
- ☆14Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- ☆26Updated 3 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- Manage audio and video datasets☆33Updated this week
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Pre-trained model and script to automatically align lyrics to polyphonic audio☆115Updated 5 years ago