jhCOR / EgoOrientBenchLinks
The Official Code Repo for EgoOrientBench [CVPR25]
☆14Updated 2 months ago
Alternatives and similar repositories for EgoOrientBench
Users that are interested in EgoOrientBench are comparing it to the libraries listed below
Sorting:
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆99Updated last year
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Updated 4 months ago
- ☆124Updated 7 months ago
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆26Updated 8 months ago
- ☆36Updated 7 months ago
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆116Updated last year
- ☆37Updated 8 months ago
- ☆16Updated 10 months ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Updated 3 weeks ago
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆20Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆19Updated last year
- Official implementation of project Honeybee (CVPR 2024)☆464Updated last year
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Updated 7 months ago
- ☆39Updated 5 months ago
- Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.☆35Updated last year
- ☆19Updated last year
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated 2 years ago
- ☆61Updated 4 months ago
- ☆25Updated 7 months ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆94Updated last year
- The Source Code for OmniVideoBench☆55Updated 2 months ago
- [EMNLP 2024] Official code repository of paper titled "PALM: Few-Shot Prompt Learning for Audio Language Models" accepted in EMNLP 2024 c…☆29Updated last year
- Official Implementation of CODE☆17Updated last year
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆99Updated last year
- ☆24Updated 2 years ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆56Updated 5 months ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Updated 9 months ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆41Updated 5 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Updated last year
- This is an official implementation for "Block Selection Method for Using Feature Norm in Out-of-distribution Detection", CVPR 2023.☆24Updated last year