Silence detection in audio stream using webrtcvad
☆49Dec 9, 2023Updated 2 years ago
Alternatives and similar repositories for rhasspy-silence
Users that are interested in rhasspy-silence are comparing it to the libraries listed below
Sorting:
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- How to create your own model for vosk☆75Aug 14, 2021Updated 4 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- A Text-To-Speech Model Developed Using 🐸STT☆13Jun 22, 2022Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- Predictive modeling of users' interpersonal characteristics by the sound of their voices and manner of speaking.☆12Jun 11, 2018Updated 7 years ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆48Sep 15, 2025Updated 6 months ago
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- Singing Voice Synthesis System based on Sinsy☆23Mar 3, 2020Updated 6 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- This is a python implementation of Hierarchical Image Matting Model for Segmentation.☆11Jun 21, 2022Updated 3 years ago
- This is a Javascript toolbox to perform online rating studies with auditory material.☆18Nov 18, 2024Updated last year
- Documentation of the Two!Ears Auditory Model☆13Feb 14, 2019Updated 7 years ago
- Calculates and compares perceptual sound texture statistics☆16Mar 16, 2021Updated 5 years ago
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- A version of the SUSTAIN model of category learning (Love, Medin, & Gureckis, 2004) implemented in Python☆20Apr 20, 2016Updated 9 years ago
- ☆14Jan 23, 2026Updated last month
- A set of Python class implementing basic several turbo-algorithms (e.g. : turbo-decoding)☆13Aug 31, 2020Updated 5 years ago
- Define an errata in table format (CSV) and then apply it to an arbitrary source. Inspired by RFC Errata, lets you keep your own errata in…☆21Aug 24, 2015Updated 10 years ago
- Another example about how to use python sockets, pyaudio and opencv to create a video audio streaming service N:1 (Client:Server)☆15Aug 21, 2016Updated 9 years ago
- Code and data documenting this paper: "Distinct Cortical Pathways for Music and Speech Revealed by Hypothesis-Free Voxel Decomposition". …☆12Jul 18, 2022Updated 3 years ago
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 6 years ago
- An R Package for Discovering Rhythmicity in Biological Data with an Interactive Web Interface☆12Mar 7, 2022Updated 4 years ago
- Oscillator-based speech syllabification algorithm☆11Sep 27, 2019Updated 6 years ago
- Sentiment Analysis using logistic regression☆16Apr 19, 2014Updated 11 years ago
- Programming in Psychological Science course. This repository contains materials for a R + Python intro course.☆16May 3, 2023Updated 2 years ago
- A collection of tools for the analysis of biological data☆12Mar 22, 2016Updated 9 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Jan 18, 2026Updated 2 months ago
- Train a fiwGAN or ciwGAN model using your own training data☆14Oct 13, 2022Updated 3 years ago
- For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project☆18Feb 25, 2020Updated 6 years ago
- Headphone-use screening test developed by Chait lab (UCL). The JS version is implemented by Sijia Zhao.☆15Apr 6, 2022Updated 3 years ago
- Deep Learning for HAR: models and tools for Human Activity Recognition from IMU sensor (accelerometer, gyroscope) data☆10Sep 16, 2020Updated 5 years ago
- fastACI toolbox: the MATLAB toolbox for investigating auditory perception using reverse correlation.☆15Dec 29, 2025Updated 2 months ago
- Gem to allow access to your named routes from Coffeescript☆14Jun 17, 2015Updated 10 years ago
- Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]☆14Oct 22, 2024Updated last year