facebookresearch / Implicit-HRTFLinks
This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Temporal Convolutional Networks", ICASSP 2021.
☆11Updated last year
Alternatives and similar repositories for Implicit-HRTF
Users that are interested in Implicit-HRTF are comparing it to the libraries listed below
Sorting:
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆24Updated 10 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆27Updated 3 months ago
- ☆10Updated 3 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆47Updated 11 months ago
- ☆11Updated 11 months ago
- Long-Term Rhythmic Video Soundtracker, ICML2023☆59Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆29Updated 4 months ago
- ☆44Updated 8 months ago
- [ICME 2024] Official Repository for The Paper, PianoBART: Symbolic Piano Music Understanding and Generating with Large-Scale Pre-Training☆19Updated 9 months ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆109Updated 3 years ago
- Audio propagation engine - Meta Reality Labs Research.☆19Updated 2 years ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆42Updated 10 months ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 5 months ago
- ☆51Updated last week
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- ☆23Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆17Updated 11 months ago
- ☆56Updated 2 years ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- ☆115Updated 5 months ago
- Query-conditioned target sound extraction model☆25Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 4 years ago