SEPIA-Framework / sepia-web-audio
Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, resampling and much more...
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sepia-web-audio
- SEPIA server to support open-source speech recognition via WebSocket connection.☆121Updated 2 weeks ago
- An even smaller speech recognizer / force aligner☆32Updated last week
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆37Updated 4 years ago
- TTS Client for Coqui TTS server☆13Updated last year
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 4 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆25Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago
- On-device speaker diarization powered by deep learning☆26Updated this week
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- Buildings block for voice-enabled applications in the browser☆33Updated last week
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆25Updated 2 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 6 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 9 months ago
- Coqui Inference Engine☆38Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Lyra V2 WebAssembly build☆29Updated 2 months ago
- Open models for Coqui STT☆122Updated last year
- A library for real-time voice processing in web browsers☆200Updated last month
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆28Updated this week
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆27Updated 5 months ago
- Resample audio in node or browser using a web assembly port of libsamplerate.☆32Updated last week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆33Updated 2 years ago