[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆20Feb 25, 2026Updated last week
Alternatives and similar repositories for AVHBench
Users that are interested in AVHBench are comparing it to the libraries listed below
Sorting:
- ☆16Nov 29, 2024Updated last year
- ☆23Aug 26, 2023Updated 2 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- 서울시 열섬현상 완화를 위한 녹지 및 바람길 입지 선정☆18Dec 29, 2019Updated 6 years ago
- ☆31Jun 19, 2025Updated 8 months ago
- Fake Face Photos by Photoshop Experts☆12Jan 14, 2019Updated 7 years ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- PoolC 홈페이지 제작 프로젝트☆10Mar 2, 2018Updated 8 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Example datasets for DeepPoseKit☆10Nov 10, 2019Updated 6 years ago
- ☆13Aug 7, 2025Updated 6 months ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- CVE-Factory☆53Feb 13, 2026Updated 2 weeks ago
- ☆13Apr 19, 2024Updated last year
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- ☆10Mar 30, 2023Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- ☆11May 17, 2024Updated last year
- ☆12Jun 2, 2018Updated 7 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- ☆10Apr 9, 2019Updated 6 years ago
- Code for the papers: "Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training", Valvano et al., DART 2021; and "Re…☆10Jan 20, 2022Updated 4 years ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 8 months ago
- ☆11Sep 1, 2024Updated last year
- Official repo for FunkNN: Neural Interpolation for Functional Generation☆11May 12, 2023Updated 2 years ago
- 关于ER-X汉化测试☆10Mar 8, 2021Updated 4 years ago
- auto ticket reservation program (python)☆12Jan 28, 2020Updated 6 years ago
- ☆10Jul 5, 2024Updated last year
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 3 years ago
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V …☆14Aug 29, 2022Updated 3 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- L2G Auto-encoder: Understanding Point Clouds by Local-to-Global Reconstruction with Hierarchical Self-Attention☆12Feb 29, 2020Updated 6 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- Reddit Crawler API for collecting datasets from Reddit.☆11Dec 31, 2022Updated 3 years ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- tmp DPI☆14Dec 18, 2024Updated last year
- ☆16Sep 29, 2025Updated 5 months ago