mbzuai-oryx / TimeTravelLinks
[ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
β18Updated last month
Alternatives and similar repositories for TimeTravel
Users that are interested in TimeTravel are comparing it to the libraries listed below
Sorting:
- Official code repository of paper titled "Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Visioβ¦β27Updated 2 months ago
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes πππβ37Updated 5 months ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"β24Updated last month
- Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"β25Updated 4 months ago
- [CVPR 2025 π₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ74Updated 3 months ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ15Updated 3 weeks ago
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite foβ¦β48Updated 10 months ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Updated 9 months ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Updated 8 months ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" β¦β34Updated 2 years ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attentionβ37Updated last year
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillationβ62Updated last week
- Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ23Updated 5 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"β35Updated last year
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Modelsβ34Updated 5 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Claβ¦β48Updated last year
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversionβ49Updated 2 months ago
- Rui Qian, Xin Yin, Dejing Douβ : Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)β38Updated 2 months ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β16Updated 3 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"β29Updated 9 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"β41Updated 7 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.β28Updated last year
- A codeβ15Updated 5 months ago
- Official Implementation of CODEβ15Updated 9 months ago
- Code for "Enhancing In-context Learning via Linear Probe Calibration"β35Updated last year
- Easy wrapper for inserting LoRA layers in CLIP.β34Updated last year
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understandingβ43Updated 6 months ago
- [InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Clβ¦β35Updated 7 months ago
- Composed Video Retrievalβ58Updated last year
- CLIP-MoE: Mixture of Experts for CLIPβ42Updated 9 months ago