SEPIA-Framework / sepia-web-audio
Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, resampling and much more...
☆47Updated last year
Alternatives and similar repositories for sepia-web-audio:
Users that are interested in sepia-web-audio are comparing it to the libraries listed below
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 2 months ago
- An even smaller speech recognizer / force aligner☆32Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆190Updated this week
- Buildings block for voice-enabled applications in the browser☆34Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆40Updated 4 years ago
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- Resample audio in node or browser using a web assembly port of libsamplerate.☆38Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Voice activity detection in Javascript☆141Updated 9 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆28Updated 5 months ago
- Open models for Coqui STT☆127Updated last year
- Web Browser Audio Detection/Speech Recording Events API☆72Updated 2 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆27Updated 3 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 6 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated 11 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 5 months ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆42Updated 5 years ago
- On-device speaker diarization powered by deep learning☆33Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- ☆34Updated 9 months ago
- Voice activation detection library for NodeJS☆54Updated 5 years ago
- A library for real-time voice processing in web browsers☆207Updated 3 months ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆27Updated 7 months ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago