A comprehensive framework to test audio comprehension of Large Audio Language Models.
☆65Jun 3, 2026Updated last week
Alternatives and similar repositories for AU-Harness
Users that are interested in AU-Harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆18Aug 1, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Universal differential equations for ecologists☆15Apr 24, 2026Updated last month
- ☆25Aug 29, 2025Updated 9 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 3 months ago
- Feel the Vibes☆13Feb 26, 2025Updated last year
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- Official code for SongEcho☆64Mar 3, 2026Updated 3 months ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆18Dec 20, 2022Updated 3 years ago
- A collection of commonly used gen-z slang with description☆10Jun 26, 2023Updated 2 years ago
- ☆10Feb 18, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)☆17Jan 18, 2024Updated 2 years ago
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆31Sep 29, 2025Updated 8 months ago
- ☆11Jun 11, 2025Updated last year
- A simple Python script to convert FOA audio to binaural.☆16Nov 29, 2022Updated 3 years ago
- Pythonic file-system interface for TOS(Tinder Object Storage)https://tosfs.readthedocs.io/en/latest/☆17Mar 27, 2026Updated 2 months ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- LinkedIn Job Applicant Scraper: A Python-based web scraper using Selenium to extract applicant information from LinkedIn profiles, facili…☆13Feb 23, 2024Updated 2 years ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 9 months ago
- Open SingSong - Implementation of 'SingSong: Generating Musical Accompaniments from Singing' by Google Research, with a few modifications☆16Jun 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deprecated Browserbase Python SDK☆10Nov 1, 2024Updated last year
- HippoMM: Hippocampal-inspired Multimodal Memory☆22May 22, 2025Updated last year
- Explanation of the llama2 repo.☆13Jul 18, 2024Updated last year
- Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)☆14Mar 21, 2025Updated last year
- simple trainer for musicgen/audiocraft☆15Jul 14, 2023Updated 2 years ago
- LiveSecBench:动态中文大模型安全榜单☆28Mar 9, 2026Updated 3 months ago
- Official implementation of "sound distance estimation" WASPAA 23☆20Dec 31, 2023Updated 2 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- ☆52Mar 31, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants☆369Updated this week
- prototyping stuff☆14Aug 17, 2025Updated 9 months ago
- Tutorial for Brats 2024 BraSyn (Missing Modality Synthesis) Challenge☆20Sep 30, 2024Updated last year
- Official repository for "3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Reg…☆16Jun 14, 2024Updated 2 years ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆42Apr 19, 2026Updated last month
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆27May 16, 2026Updated 3 weeks ago
- Unveiling the Knowledge of Hindu Scriptures☆13Mar 31, 2025Updated last year