facebookresearch / Implicit-HRTFLinks
This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Temporal Convolutional Networks", ICASSP 2021.
☆12Updated 2 years ago
Alternatives and similar repositories for Implicit-HRTF
Users that are interested in Implicit-HRTF are comparing it to the libraries listed below
Sorting:
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆61Updated 5 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆40Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆47Updated last year
- Official source codes of airsep☆39Updated last year
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Updated 11 months ago
- ☆123Updated 11 months ago
- Repo for Visual Acoustic Matching, CVPR 2022☆70Updated 2 years ago
- ☆23Updated 2 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆110Updated 3 years ago
- ☆48Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆42Updated 3 months ago
- Source code for the paper 'Audio Captioning Transformer'☆57Updated 3 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 11 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- ☆53Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- ☆20Updated last year
- ☆25Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Updated 9 months ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆53Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- ☆47Updated 8 months ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year