liuxubo717 / LASS
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆135Updated 11 months ago
Related projects: ⓘ
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆68Updated 3 years ago
- Visually-Aware Audio Captioning☆41Updated last year
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Updated 2 years ago
- Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023☆54Updated last year
- ☆20Updated 2 years ago
- Audio Captioning datasets for PyTorch.☆98Updated 2 weeks ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆28Updated 3 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆54Updated last year
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆99Updated 5 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆194Updated last month
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆151Updated 2 years ago
- ☆35Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆88Updated 3 months ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆146Updated 2 months ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆159Updated 5 months ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆49Updated last year
- The official source code of UniAudio☆81Updated 5 months ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆39Updated 2 weeks ago
- Audio Codec Speech processing Universal PERformance Benchmark☆201Updated last week
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆112Updated 3 weeks ago
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆10Updated 2 months ago
- This package aims at simplifying the download of the AudioCaps dataset.☆29Updated 9 months ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆44Updated 5 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆92Updated 3 weeks ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆133Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆123Updated 2 months ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆51Updated last year