[ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
β19May 22, 2025Updated last year
Alternatives and similar repositories for TimeTravel
Users that are interested in TimeTravel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmarkβ17May 25, 2025Updated last year
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ23May 7, 2026Updated 2 weeks ago
- [NAACL 2025 π₯] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.β38Apr 17, 2025Updated last year
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understandingβ¦β54Mar 13, 2025Updated last year
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Claβ¦β47Sep 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"β31Dec 18, 2025Updated 5 months ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 11 months ago
- [CVPR 2025 π₯] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses theβ¦β46May 26, 2025Updated last year
- Bilingual Medical Mixture of Experts LLMβ32Nov 23, 2024Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Trackingβ33Mar 14, 2025Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- [InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Clβ¦β39Dec 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite foβ¦β50Aug 23, 2024Updated last year
- [ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"β10Aug 2, 2024Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabiβ¦β79Sep 24, 2024Updated last year
- Official code repository of paper titled "Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Visioβ¦β35May 11, 2025Updated last year
- Composed Video Retrievalβ62May 2, 2024Updated 2 years ago
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"β¦β52Nov 14, 2023Updated 2 years ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image β¦β144May 11, 2026Updated 2 weeks ago
- QT-DOG: QUANTIZATION-AWARE TRAINING FOR DOMAIN GENERALIZATIONβ25Nov 30, 2025Updated 5 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ29Oct 20, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Enhancing In-context Learning via Linear Probe Calibration"β37Apr 24, 2024Updated 2 years ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"β26Jun 8, 2025Updated 11 months ago
- β70Jul 2, 2025Updated 10 months ago
- [IEEE TMI 2025] MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retentionβ19Dec 15, 2025Updated 5 months ago
- [CVPR 2025 π₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ103Apr 14, 2025Updated last year
- An Open-source Factuality Evaluation Demo for LLMsβ32Feb 23, 2026Updated 3 months ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"β13Aug 6, 2024Updated last year
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detectionβ31Jun 21, 2023Updated 2 years ago
- β11Oct 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β21Jul 3, 2025Updated 10 months ago
- β19Sep 18, 2025Updated 8 months ago
- Code and dataset release for the paper "Unstructured Evidence Attribution for Long Context Query Focused Summarization"β11Nov 3, 2025Updated 6 months ago
- Reproducible analyses for the NicheCompass manuscriptβ14Jul 3, 2025Updated 10 months ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- [CVPR 2024 π₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses thaβ¦β957Aug 5, 2025Updated 9 months ago
- Source code for MICCAI 2022 paper entitled: 'Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification'β36Jan 13, 2023Updated 3 years ago