mbrotos/SoundSeg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mbrotos/SoundSeg)

mbrotos / SoundSeg

Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation

☆13

Alternatives and similar repositories for SoundSeg

Users that are interested in SoundSeg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

griko / vanpy
View on GitHub
☆19Jul 23, 2025Updated 11 months ago
CienProject2014 / OneLevelHero
View on GitHub
libGDX based Role-Playing Game (rpg)
☆12Apr 3, 2016Updated 10 years ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Sara-Ahmed / ASiT
View on GitHub
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆30Mar 10, 2024Updated 2 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
KaguraGateway / node-audio-volume-mixer
View on GitHub
"node-audio-volume-mixer" is a library that allows you to control volume in Node.js.
☆13Mar 27, 2022Updated 4 years ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 8 months ago
holywyvern / mv-pixi-upgrade
View on GitHub
A base project than uses PIXI V3 and not V2.
☆13Nov 12, 2015Updated 10 years ago
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated 11 months ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhai-lw / L3AC
View on GitHub
A lightweight audio codec based on a single quantizer
☆35Sep 4, 2025Updated 10 months ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
Zyriix / D2O
View on GitHub
Official implemention for Diffusion Models Are Innate One-Step Generators
☆27Jun 25, 2025Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 8 months ago
ttlabtuat / SingLEM
View on GitHub
Implementation and pretrained model for the SingLEM paper.
☆15Jul 15, 2026Updated last week
yi-ding-cs / EEG-PatchFormer
View on GitHub
[EMBC-2025] PyTorch implementation of EEG-PatchFormer
☆15Apr 9, 2025Updated last year
ZhangXinWhut / SimWhisper-Codec
View on GitHub
Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"
☆37Jan 28, 2026Updated 5 months ago
about518 / kanColleDbPost
View on GitHub
艦これ統計データベースへのPOSTサンプル
☆15Aug 18, 2015Updated 10 years ago
fss1t / CausalStarGANv2-VC
View on GitHub
☆22Apr 4, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ziyunliu4444 / osu2mir
View on GitHub
Repository for Osu2MIR: Beat Tracking Dataset Derived From Osu! Data (ISMIR2025 LBD)
☆17Oct 6, 2025Updated 9 months ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
ishine / Mutiband-HIFIGAN
View on GitHub
Mutiband version of HIFIGAN
☆19Nov 6, 2020Updated 5 years ago
benchopt / benchmark_bci
View on GitHub
Benchmark for Brain Computer Interface methods
☆19Feb 1, 2025Updated last year
eloimoliner / CQT_pytorch
View on GitHub
Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters
☆36Jul 7, 2026Updated 2 weeks ago
pokang-liu / EEG_MWA
View on GitHub
Mental Workload Assessment using EEG
☆16Jul 13, 2019Updated 7 years ago
jamesparsloe / llm.speech
View on GitHub
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Jun 1, 2024Updated 2 years ago