Sreyan88/ReCLAP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sreyan88/ReCLAP)

Sreyan88 / ReCLAP

☆33

Alternatives and similar repositories for ReCLAP

Users that are interested in ReCLAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆119Jan 28, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gzhu06 / Cacophony
View on GitHub
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
☆49Jan 19, 2026Updated 6 months ago
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
HanxunH / AudioMosaic
View on GitHub
[ICML2026] AudioMosaic: Contrastive Masked Audio Representation Learning
☆23May 15, 2026Updated 2 months ago
apple-yinhan / TQ-SED
View on GitHub
☆24Mar 19, 2025Updated last year
adefossez / audio_mod_idessai
View on GitHub
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆19Updated this week
FreedomIntelligence / FusionAudio
View on GitHub
Towards Fine-grained Audio Captioning with Multimodal Contextual Cues
☆87Jan 4, 2026Updated 6 months ago
EuiYeonKim / BSMamba2
View on GitHub
Official Implementation of BSMamba2
☆16Nov 28, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
ETH-DISCO / audio-atlas
View on GitHub
☆15Feb 6, 2026Updated 5 months ago
mileskuo42 / AudioMarkBench
View on GitHub
Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆48Aug 23, 2024Updated last year
ETH-DISCO / blap
View on GitHub
Official repo for BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning presented at ICASSP 2025
☆16Nov 18, 2024Updated last year
crlandsc / torch-l1-snr
View on GitHub
Variations of L1 SNR Loss function for training audio source separation machine learning models
☆45May 1, 2026Updated 2 months ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
raraz15 / neural-music-fp
View on GitHub
"Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025
☆38Sep 11, 2025Updated 10 months ago
ta012 / SSLAM
View on GitHub
[ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
☆79Oct 8, 2025Updated 9 months ago
qiuqiangkong / music_llm
View on GitHub
☆56Jul 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
msplabresearch / MSP-Podcast_Challenge_IS2025
View on GitHub
MSP-Podcast Challenge Baseline Code for Interspeech 2025
☆29Dec 4, 2024Updated last year
mulab-mir / muchomusic
View on GitHub
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
☆46Dec 3, 2024Updated last year
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
SiavashShams / ssamba
View on GitHub
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆140Nov 5, 2025Updated 8 months ago
kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AmphionTeam / FlexiCodec
View on GitHub
[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
☆50Jul 1, 2026Updated 3 weeks ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
haoxiangsnr / llm-tse
View on GitHub
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆43Oct 13, 2023Updated 2 years ago
koichi-saito-sony / ismir2024_tutorial_demo
View on GitHub
☆18Nov 8, 2024Updated last year
Fraunhofer-IIS / ODAQ
View on GitHub
A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.
☆53Feb 24, 2026Updated 5 months ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week