Explaining audio differences using language
☆16Feb 11, 2025Updated last year
Alternatives and similar repositories for ADIFF
Users that are interested in ADIFF are comparing it to the libraries listed below
Sorting:
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- ☆51Apr 13, 2025Updated 11 months ago
- small audio language model for reasoning☆86Dec 4, 2025Updated 3 months ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- Official MATPAC implementation and trained model's weights☆27Sep 23, 2025Updated 5 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆15Nov 25, 2025Updated 3 months ago
- Stanford CS224W: Machine Learning with Graphs (GNN)☆11Sep 6, 2022Updated 3 years ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆52Sep 2, 2025Updated 6 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆96Jun 12, 2025Updated 9 months ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- ☆12Nov 12, 2024Updated last year
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆69Jul 19, 2025Updated 8 months ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- ☆10Oct 16, 2025Updated 5 months ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- ☆10Sep 25, 2024Updated last year
- ☆13Aug 11, 2018Updated 7 years ago
- https://nvmexplorer.seas.harvard.edu NVMExplorer is a cross-stack design space exploration framework for evaluating and comparing on-chip…☆21Jun 21, 2024Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- ☆40Feb 18, 2026Updated last month
- ☆11Dec 28, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Oct 16, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆340Updated this week
- ☆14Jul 24, 2025Updated 7 months ago
- 서강대학교 알고리즘 소학회 Sogang ICPC Team에서 진행하였던 강의 주제들입니다.☆18Jul 18, 2022Updated 3 years ago