Sreyan88 / GAMA
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
☆119Updated 3 months ago
Alternatives and similar repositories for GAMA:
Users that are interested in GAMA are comparing it to the libraries listed below
- AudioBench: A Universal Benchmark for Audio Large Language Models☆176Updated this week
- Audio Captioning datasets for PyTorch.☆115Updated 2 weeks ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆123Updated 3 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆91Updated 10 months ago
- Versatile Evaluation of Speech and Audio☆176Updated this week
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆150Updated 6 months ago
- ☆54Updated last week
- Audio-FLAN☆140Updated 3 weeks ago
- The open source code for LLM-Codec☆132Updated 7 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆194Updated 3 weeks ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆187Updated this week
- UTokyo-SaruLab MOS Prediction System☆165Updated last month
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'