Explaining audio differences using language
☆16Feb 11, 2025Updated last year
Alternatives and similar repositories for ADIFF
Users that are interested in ADIFF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- ☆52Mar 24, 2026Updated 2 weeks ago
- small audio language model for reasoning☆85Dec 4, 2025Updated 4 months ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆34Mar 14, 2025Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 6 months ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- Official MATPAC implementation and trained model's weights☆28Sep 23, 2025Updated 6 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 4 months ago
- Stanford CS224W: Machine Learning with Graphs (GNN)☆12Sep 6, 2022Updated 3 years ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆52Sep 2, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆97Jun 12, 2025Updated 9 months ago
- ☆12Nov 12, 2024Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆71Mar 22, 2026Updated 2 weeks ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- ☆11Mar 23, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆26Nov 18, 2025Updated 4 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation☆17Jun 2, 2025Updated 10 months ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- ☆10Sep 25, 2024Updated last year
- ☆13Aug 11, 2018Updated 7 years ago
- https://nvmexplorer.seas.harvard.edu NVMExplorer is a cross-stack design space exploration framework for evaluating and comparing on-chip…☆21Jun 21, 2024Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 3D Gaussian Splat Easily Attacked to Cause Harm☆12Aug 5, 2025Updated 8 months ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- ☆41Feb 18, 2026Updated last month
- ☆11Dec 28, 2023Updated 2 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- ☆14Jul 24, 2025Updated 8 months ago