Exgc/OmniSep

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Exgc/OmniSep)

Exgc / OmniSep

Sound Separation, Omni modal

☆29

Alternatives and similar repositories for OmniSep

Users that are interested in OmniSep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NKU-HLT / AudioEditor
View on GitHub
☆47Apr 2, 2025Updated last year
merlresearch / unified-source-separation
View on GitHub
Official repo for task-aware unified source separation (TUSS)
☆23Jul 31, 2025Updated 11 months ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Audio-AGI / FlowSep
View on GitHub
Official implementation for FlowSep
☆77Jan 2, 2025Updated last year
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
LuluW8071 / Automatic-Speech-Recognition-with-PyTorch
View on GitHub
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
☆11Jan 23, 2025Updated last year
sony / mmaudiosep
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
matthewmcq / upscalemp3_v2
View on GitHub
Mp3 to wav super resolution model for audio restoration & enhancement. U-Net + Discrete Wavelet Transform (DWT) Architecture
☆21Dec 1, 2025Updated 7 months ago
laitselec / MuFun
View on GitHub
☆37Aug 31, 2025Updated 10 months ago
JHU-LCAP / FlexSED
View on GitHub
open-vocabulary sound event detection
☆53Dec 17, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gxu82 / MVDR-Speech-Enhancement
View on GitHub
☆16Jul 14, 2020Updated 6 years ago
kaistmm / V2SFlow
View on GitHub
[ICASSP 2025] V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow
☆21Jun 3, 2025Updated last year
TeaPoly / PLCPA-ASYM-Loss
View on GitHub
The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss
☆15Sep 4, 2023Updated 2 years ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
woongzip1 / UniverSR
View on GitHub
Official implemtation of UniverSR (ICASSP 2026)
☆59Apr 9, 2026Updated 3 months ago
aleXiehta / Causal-SE
View on GitHub
Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"
☆28Feb 26, 2023Updated 3 years ago
YoungJay0612 / AEC-Summary-Including-Papers-and-Code
View on GitHub
记录关于AEC的论文和代码、博客以及相关资料
☆16Jul 26, 2022Updated 3 years ago
WikiChao / VisAH
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Learning to Highlight Audio by Watching Movies"
☆15Oct 1, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
costrice / vminer
View on GitHub
Official implementation and project page of the CVPR'24 paper "VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Li…
☆14Aug 6, 2024Updated last year
yoongi43 / music_source_separation
View on GitHub
☆14Jan 12, 2023Updated 3 years ago
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆107Jul 23, 2025Updated last year
sungwon23 / BSRNN
View on GitHub
☆138Apr 24, 2023Updated 3 years ago
suimuc / MTV_Framework
View on GitHub
☆23Oct 15, 2025Updated 9 months ago
Shybert-AI / AEC-Two-Stage-Based
View on GitHub
基于两阶段的声学回声消除系统 A Two-Stage-Based Acoustic Echo Cancellation System
☆17Feb 22, 2026Updated 5 months ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
urgent-challenge / urgent2025_challenge
View on GitHub
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
☆85May 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago
jasongief / TGS-Agent
View on GitHub
[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆20Nov 8, 2025Updated 8 months ago
cyanbx / Frieren-V2A
View on GitHub
Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)
☆62Apr 3, 2025Updated last year
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
NikolaiKyhne / RWSAMamba-UNet
View on GitHub
Official repository for the paper "Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enh…
☆19May 5, 2026Updated 2 months ago