zszheng147 / Spatial-ASTLinks
π¦ Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
β56Updated 6 months ago
Alternatives and similar repositories for Spatial-AST
Users that are interested in Spatial-AST are comparing it to the libraries listed below
Sorting:
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipelineβ181Updated 8 months ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"β72Updated 4 months ago
- This package aims at simplifying the download of the AudioCaps dataset.β36Updated last year
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generationβ47Updated last month
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)β55Updated last year
- β42Updated 2 years ago
- Audio-FLANβ157Updated 5 months ago
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer