facebookresearch / NORD

Code and pre-trained model release for the ICASSP 2023 Paper "NORD NON-MATCHING REFERENCE BASED RELATIVE DEPTH ESTIMATION FROM BINAURAL AUDIO"

☆11

Related projects ⓘ

Alternatives and complementary repositories for NORD

facebookresearch / Implicit-HRTF
This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…
☆11Updated last year
facebookresearch / rlr-audio-propagation
Audio propagation engine - Meta Reality Labs Research.
☆17Updated 2 years ago
shivammehta25 / Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
☆37Updated last year
JinhuaLiang / lam4fsl
An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"
☆28Updated last year
tencent-ailab / TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Updated last year
XZWY / MSLDM
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆18Updated 2 months ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆27Updated 3 months ago
facebookresearch / novel-view-acoustic-synthesis
Code for Novel View Acoustic Synthesis paper
☆44Updated last year
anton-jeran / MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆42Updated 2 months ago
mushanshanshan / ESLTTS
ESLTTS dataset
☆16Updated 5 months ago
Sosdatasets / SoS_Dataset
☆11Updated 4 months ago
apple / ml-nvas3d
☆46Updated 4 months ago
cvlab-columbia / voicecamo
Code for the paper Real-Time Neural Voice Camouflage
☆28Updated 2 years ago
MaxMax2016 / max-vc
singing voice conversion without f0
☆22Updated last year
walker-hyf / GPT-Talker
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆56Updated 3 weeks ago
zeyuxie29 / PicoAudio
☆36Updated 4 months ago
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆16Updated last week
kohei0209 / self-remixing
Official implementation of Self-Remixing
☆11Updated 9 months ago
AgentCooper2002 / EDMSound
Codebase and project page for EDMSound
☆29Updated last year
Kikyo-16 / airgen
Official source codes of airsep
☆34Updated 7 months ago
slSeanWU / beats-conformer-bart-audio-captioner
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆31Updated 10 months ago
seungheondoh / speech-to-music
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Updated last year
facebookresearch / visual-acoustic-matching
Repo for Visual Acoustic Matching, CVPR 2022
☆65Updated last year
facebookresearch / 6DoF-Auraliser
An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…
☆30Updated last year
atosystem / SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
☆109Updated last year
JuanFMontesinos / Acappella-YNet
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆14Updated 2 years ago
zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated last year
maxrmorrison / torbi
Viterbi decoding in PyTorch
☆27Updated last month