Explaining audio differences using language
☆16Feb 11, 2025Updated last year
Alternatives and similar repositories for ADIFF
Users that are interested in ADIFF are comparing it to the libraries listed below
Sorting:
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- ☆50Apr 13, 2025Updated 10 months ago
- Official MATPAC implementation and trained model's weights☆26Sep 23, 2025Updated 5 months ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- Actually released!☆10Feb 24, 2021Updated 5 years ago
- small audio language model for reasoning☆86Dec 4, 2025Updated 2 months ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆39Nov 20, 2024Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆50Sep 2, 2025Updated 5 months ago
- A working FE Bypass for all Roblox clients☆19Jan 10, 2026Updated last month
- Desktop client for Walltaker powered by golang☆12Sep 13, 2022Updated 3 years ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- ☆40Feb 18, 2026Updated last week
- This is for Meridian (Traditional Chinese Medicine conception) prediction by machining learning method.☆11Sep 30, 2019Updated 6 years ago
- ☆13Apr 14, 2025Updated 10 months ago
- A rewrite of Open Hexagon☆12Feb 21, 2026Updated last week
- core shell functions building blocks for advanced AI pipelines☆15May 2, 2023Updated 2 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Mar 27, 2024Updated last year
- [CVPR 2025] Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation☆19Dec 18, 2025Updated 2 months ago
- Audiosurf 1 server replacement written in TypeScript using node.js, Fastify and Prisma.☆17Jan 23, 2026Updated last month
- MOVED TO☆10Oct 29, 2018Updated 7 years ago
- ☆12Nov 12, 2024Updated last year
- Vim environment for authoring, compiling, and debugging Inform7 based interactive fiction works.☆11Aug 22, 2020Updated 5 years ago
- Bad Dragon 3D Model Downloader is a command-line utility that facilitates the downloading of 3D models, along with their respective textu…☆11May 18, 2023Updated 2 years ago
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆16Mar 11, 2025Updated 11 months ago
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆14Dec 11, 2024Updated last year
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Natural language programming language☆11Aug 28, 2020Updated 5 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 3 months ago
- Webapp that interfaces with 3DS Capture units to allow playback on any device☆13Sep 8, 2020Updated 5 years ago
- Updated version of the Ollivander's plugin for Spigot☆11Updated this week
- Supervised and unsupervised Concept-based explanation of pretrained music classifiers☆12Jul 27, 2023Updated 2 years ago