fmqa/samplecnn-speech-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fmqa/samplecnn-speech-detection)

fmqa / samplecnn-speech-detection

Speech/Music discrimination using SampleCNN

☆18

Alternatives and similar repositories for samplecnn-speech-detection

Users that are interested in samplecnn-speech-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qlemaire22 / speech-music-detection
View on GitHub
Python framework for Speech and Music Detection using Keras.
☆113Mar 24, 2023Updated 3 years ago
MikeMpapa / CNNs-Speech-Music-Discrimination
View on GitHub
A deep learning framework for Speech-Music discrimination of continuous audio streams
☆68Aug 3, 2018Updated 7 years ago
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
jagger2048 / WebRtc_AGC1
View on GitHub
This repository is webrtc agc module demo.
☆12Jan 23, 2019Updated 7 years ago
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
yangyi0818 / DPARNet
View on GitHub
Dual-Path Attention and Recurrent Network for speech separation
☆21Sep 12, 2024Updated last year
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
korenyoni / opus-api
View on GitHub
OPUS (opus.nlpl.eu) Python3 API
☆18Nov 23, 2024Updated last year
AranKomat / adapinp
View on GitHub
Unofficial implementation of Adaptive Input in PyTorch
☆12Feb 22, 2019Updated 7 years ago
CoEDL / elan-helpers
View on GitHub
Tools and scripts for working with ELAN
☆10Aug 4, 2022Updated 3 years ago
ConferencingSpeech / ConferencingSpeech2022
View on GitHub
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
☆45Apr 11, 2022Updated 4 years ago
mondaugen / timbretoolbox
View on GitHub
A toolbox for extracting audio descriptors in MATLAB.
☆12Jul 20, 2016Updated 9 years ago
guokr / TorchCTR
View on GitHub
CTR Prediction on PyTorch
☆14Sep 2, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sandeepnmenon / Scene-Change-Detection
View on GitHub
Detects scene change or cuts in a video file
☆11Oct 23, 2017Updated 8 years ago
lcn-kul / conferencing-speech-2022
View on GitHub
Source code for LCN submission for ConferencingSpeech2022 challenge.
☆14Nov 11, 2023Updated 2 years ago
yuwchen / InQSS
View on GitHub
☆15Oct 6, 2023Updated 2 years ago
satvik-venkatesh / audio-seg-data-synth
View on GitHub
Artificially synthesising data for audio segmentation to improve music-speech detection
☆17Jul 7, 2021Updated 4 years ago
soham97 / PAM
View on GitHub
PAM is a no-reference audio quality metric for audio generation tasks
☆76Jul 19, 2024Updated last year
foamliu / Listen-Attend-Spell-v2
View on GitHub
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆39Jul 25, 2019Updated 6 years ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
diguo2046 / psola
View on GitHub
Python package implementing the TD-PSOLA algorithm for speech processing
☆43Aug 18, 2017Updated 8 years ago
JusperLee / speechbrain-docs-zh-cn
View on GitHub
SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
datawhalechina / musiclm-universe
View on GitHub
Music Language Model Generation, Optimization, and Practice
☆61Apr 20, 2026Updated 2 months ago
naver / domainshift-prediction
View on GitHub
☆11May 26, 2020Updated 6 years ago
ndb796 / Face-Gender-Classification-PyTorch
View on GitHub
Face Gender Classification Tutorial: PyTorch Implementations
☆12Mar 2, 2021Updated 5 years ago
skerit / cmusphinx
View on GitHub
A git clone of the CMU Sphinx svn repository
☆61Feb 15, 2013Updated 13 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆59Nov 3, 2025Updated 8 months ago
kenders2000 / distortionDetection
View on GitHub
C++ Program to detect Clipping and other overload based nonlinear distortions in Wav Files
☆35Feb 4, 2022Updated 4 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
abhinavkashyap / domadapter
View on GitHub
Domain Adaptation and Adapters
☆16Feb 28, 2023Updated 3 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 4 years ago
Zeqiang-Lai / Prosody_Prediction
View on GitHub
Predict prosody labels for Chinese sentences.
☆42Jul 7, 2022Updated 3 years ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Nov 1, 2025Updated 8 months ago
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
lysanderism / TimeAudio
View on GitHub
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…
☆30Nov 18, 2025Updated 7 months ago