cirosilvano / easyvad
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated 11 months ago
Alternatives and similar repositories for easyvad:
Users that are interested in easyvad are comparing it to the libraries listed below
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆27Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆39Updated 7 months ago
- A SQLite extension for working with float and binary vectors. Work in progress!☆20Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆30Updated 9 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆48Updated last week
- Distributed Approximate Nearest Neighbors Database https://anndb.com☆36Updated 3 years ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆45Updated last year
- Speech-end detection library, based on WebRTC's VAD engine☆21Updated 9 months ago
- numpy ufuncs for vector similarity☆14Updated last year
- TTS support with GGML☆25Updated last month
- gRPC server for hnswlib☆14Updated 2 years ago
- Rust crate for some audio utilities☆22Updated 2 weeks ago
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated 11 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆202Updated last week
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago
- Voicemail.... for the web! Create voicemails via WebRTC and Transcribe them.☆36Updated last week
- ☆29Updated 3 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 2 years ago
- First token cutoff sampling inference example☆29Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Coqui Inference Engine☆38Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14Updated 3 years ago