l3das/L3DAS23

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/l3das/L3DAS23)

l3das / L3DAS23

Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge

☆16

Alternatives and similar repositories for L3DAS23

Users that are interested in L3DAS23 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ispamm / HI2I
View on GitHub
Official PyTorch repository for Hypercomplex Image-to-Image Transaltion
☆18Jan 23, 2023Updated 3 years ago
ispamm / DM4ASC
View on GitHub
Diffusion Models for Audio Semantic Communication
☆18Apr 17, 2024Updated 2 years ago
LuigiSigillo / StawGAN
View on GitHub
Official PyTorch repository for StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation
☆19Oct 18, 2023Updated 2 years ago
ispamm / TRIANGLE
View on GitHub
☆18Apr 24, 2026Updated 3 months ago
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
thomeou / General-network-architecture-for-sound-event-localization-and-detection
View on GitHub
This repository consists of python code to train sound event localization and detection models.
☆22Jan 21, 2021Updated 5 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
echocatzh / GTCNN
View on GitHub
Personalized AEC
☆19Nov 3, 2022Updated 3 years ago
l3das / L3DAS21
View on GitHub
☆37Jun 22, 2022Updated 4 years ago
ASLP-lab / FMSU-Bench
View on GitHub
Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
☆25May 21, 2026Updated 2 months ago
ispamm / FolAI
View on GitHub
Stable-V2A: Synthesis of Synchronized Sound Effect with Temporal and Semantic Controls
☆18May 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sveinnpalsson / semivaebrats
View on GitHub
Semi-supervised variational autoencoder for survival prediction
☆15Nov 29, 2023Updated 2 years ago
ispamm / Img2Img-SC
View on GitHub
☆19Jun 16, 2024Updated 2 years ago
seorim0 / DNN-based-Speech-Enhancement-in-the-frequency-domain
View on GitHub
DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…
☆61Apr 2, 2022Updated 4 years ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
mcomunita / syncfusion
View on GitHub
SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis
☆19Jul 22, 2024Updated 2 years ago
e13000 / directional_sparse_filtering
View on GitHub
Directional sparse filtering for blind speech separation
☆11Jun 8, 2021Updated 5 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
cogmhear / Intelligibility-Oriented-Audio-Visual-Speech-Enhancement
View on GitHub
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
☆15Sep 6, 2024Updated last year
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
naver-ai / rewas
View on GitHub
Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"
☆44Dec 13, 2024Updated last year
zoezou2015 / abs_pretraining
View on GitHub
☆10Apr 28, 2021Updated 5 years ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
limuhit / pseudocylindrical_convolution
View on GitHub
Pseudocylindrical convolutions for Learned Omnidirectional Image Compression
☆13Jan 16, 2026Updated 6 months ago
Gitxiaoke / SNnet
View on GitHub
网络出处：Interactive Speech and Noise Modeling for Speech Enhancement
☆28Jan 10, 2022Updated 4 years ago
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
manishmalik / Voice-Classification
View on GitHub
Gender Classification from voice
☆10Apr 27, 2015Updated 11 years ago
rsykoss / ntu-automate-star-wars
View on GitHub
☆10Dec 8, 2022Updated 3 years ago
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆71Mar 13, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TParcollet / Quaternion-Neural-Networks
View on GitHub
This is the updated version of https://github.com/Orkis-Research/Pytorch-Quaternion-Neural-Networks
☆21Mar 5, 2020Updated 6 years ago
genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
View on GitHub
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…
☆10Nov 15, 2017Updated 8 years ago
Qinwen-Hu / dparn
View on GitHub
☆74Sep 6, 2022Updated 3 years ago
ashispati / dmelodies_controllability
View on GitHub
Code for running experiments in our ISMIR'21 paper titled: "Is Disentanglement enough? On Latent Representations for Controllable Music G…
☆12Aug 7, 2021Updated 4 years ago
roger-tseng / av-superb
View on GitHub
A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)
☆58Apr 17, 2024Updated 2 years ago
Andong-Li-speech / EaBNet
View on GitHub
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…
☆107Jun 10, 2022Updated 4 years ago
rgrzeszi / bof-aed
View on GitHub
Bag-of-Features Acoustic Event Detection
☆14Oct 5, 2016Updated 9 years ago