DCASE2023-Task7-Foley-Sound-Synthesis/dcase2023_task7_baseline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DCASE2023-Task7-Foley-Sound-Synthesis/dcase2023_task7_baseline)

DCASE2023-Task7-Foley-Sound-Synthesis / dcase2023_task7_baseline

☆32

Alternatives and similar repositories for dcase2023_task7_baseline

Users that are interested in dcase2023_task7_baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
KeisukeImoto / RWCPSSD_Onomatopoeia
View on GitHub
RWCP-SSD-Onomatopoeia
☆24Jun 28, 2023Updated 3 years ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
audio-captioning / dcase-2020-baseline
View on GitHub
Audio captioning baseline system for DCASE 2020 challenge.
☆38Aug 22, 2023Updated 2 years ago
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ktatar / rawaudiovae
View on GitHub
☆12Jun 9, 2025Updated last year
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆52Jun 14, 2022Updated 4 years ago
DCASE-REPO / DESED_task
View on GitHub
Domestic environment sound event detection task
☆157Jun 11, 2024Updated 2 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
camenduru / audioldm-colab
View on GitHub
AudioLDM text to audio colab
☆18Nov 6, 2023Updated 2 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆78Jul 1, 2022Updated 4 years ago
sungnyun / avsr-temporal-dynamics
View on GitHub
(SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition
☆13Oct 22, 2024Updated last year
YoonjinXD / T-FOLEY
View on GitHub
Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…
☆34May 25, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
v-iashin / SpecVQGAN
View on GitHub
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
☆372Jul 12, 2024Updated 2 years ago
shinying / hugo-PaperMod-academics
View on GitHub
A fast, clean, responsive Hugo theme, now for academics.
☆10May 20, 2026Updated 2 months ago
ddlBoJack / MT4SSL
View on GitHub
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆45Mar 25, 2024Updated 2 years ago
TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
aliutkus / swf
View on GitHub
☆15Mar 30, 2020Updated 6 years ago
zayd / deep-audio-super-resolution
View on GitHub
Deep neural network for audio super-resolution tasks
☆15Sep 6, 2020Updated 5 years ago
Andong-Li-speech / GaGNet
View on GitHub
This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …
☆72Feb 10, 2022Updated 4 years ago
LCAV / pylocus
View on GitHub
Localization package using distance and/or angle measurements
☆16Mar 11, 2022Updated 4 years ago
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
ben-hayes / sinusoidal-gradient-descent
View on GitHub
Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"
☆61Mar 8, 2023Updated 3 years ago
Audio-WestlakeU / RCT
View on GitHub
This repo gives the code for the official implementation of RCT.
☆13Jun 28, 2022Updated 4 years ago
tarepan / Scyclone-PyTorch
View on GitHub
Reproduction of "Scyclone" with PyTorch
☆16Jan 6, 2021Updated 5 years ago
ZhongYang2026 / Sandglasset-A-Light-Multi-Granularity-Self-Attentive-Network-For-Time-Domain-Speech-Separation
View on GitHub
Speech Separation
☆21Mar 7, 2024Updated 2 years ago
yangdongchao / Text-to-sound-Synthesis
View on GitHub
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
☆366Aug 3, 2023Updated 2 years ago
yoyolicoris / spectrogram-inversion
View on GitHub
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
☆51Jun 12, 2025Updated last year
ZhongshuHou / MHA-DPCRN
View on GitHub
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
☆24Jul 4, 2022Updated 4 years ago
thomas-mckenzie / srir_interpolation
View on GitHub
☆14Jun 6, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wanganran / HybridBeam
View on GitHub
Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing
☆19Apr 10, 2024Updated 2 years ago
seungheondoh / music_caps_dl
View on GitHub
Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 3 years ago
ffxiong / stsubnet
View on GitHub
☆22Oct 17, 2024Updated last year
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
SunnyValleyStudio / Unity-simple-Pick-Up-system
View on GitHub
How to create a pick up and interaction system in Unity 3d
☆12May 5, 2022Updated 4 years ago
mulab-mir / muchomusic
View on GitHub
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
☆46Dec 3, 2024Updated last year