Code release for "VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment" [TMLR, 2023]
☆10Dec 9, 2023Updated 2 years ago
Alternatives and similar repositories for VoLTA
Users that are interested in VoLTA are comparing it to the libraries listed below
Sorting:
- CVPR2026☆25Sep 18, 2025Updated 6 months ago
- ☆32Oct 30, 2023Updated 2 years ago
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆137Mar 20, 2024Updated 2 years ago
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 8 months ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆28Jan 25, 2026Updated last month
- A python implement for Certifiable Robust Multi-modal Training☆19Jun 21, 2025Updated 9 months ago
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆11Feb 7, 2025Updated last year
- A framework for Longitudinal Radiology Report Generation☆26Aug 10, 2024Updated last year
- ☆10Oct 27, 2020Updated 5 years ago
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆29May 30, 2025Updated 9 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 6 months ago
- ☆18Sep 19, 2025Updated 6 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- the public repo for stats205 scribe notes at Stanford University☆14Jun 10, 2021Updated 4 years ago
- Code and models for MICCAI23 paper: "Self-Supervised Learning for Endoscopy Video Analysis".☆22Oct 2, 2023Updated 2 years ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 2 years ago
- TabMap for high-performance tabular data analysis - Nature BME☆19Jan 8, 2025Updated last year
- Failures in machine learning for medical imaging☆32Feb 15, 2022Updated 4 years ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆92Mar 9, 2025Updated last year
- Some time series vectorization methods which could give better representation for classification / clustering or other analysis.☆11Jan 4, 2016Updated 10 years ago
- Ready made firmware file for TPLink TL-MR3020 router for using extroot☆12Nov 8, 2015Updated 10 years ago
- ☆30Apr 27, 2012Updated 13 years ago
- AAAI2023 Reducing Domain Gap in Frequency and Spatial domain for Cross-modality Domain Adaptation on Medical Image Segmentation☆28Sep 19, 2023Updated 2 years ago
- Ampache Player for Android (moved)☆10Oct 8, 2013Updated 12 years ago
- Docker for running stroke lesion core segmentation☆30Dec 15, 2020Updated 5 years ago
- The more often you click a word in the headlines, the more interesting are your news.☆13Mar 27, 2017Updated 8 years ago
- 2nd place solution for the RSNA STR Pulmonary Embolism Detection competition on Kaggle.☆30Nov 29, 2020Updated 5 years ago
- Evolution-ary Reinforcement Learning☆12Apr 16, 2017Updated 8 years ago
- Jinc (EWA Lanczos) Resampler Plugin for Avisynth/Avisynth+☆21Jul 29, 2014Updated 11 years ago
- SugarCRM integration with Live Helper Chat☆10Jun 21, 2021Updated 4 years ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆112Oct 15, 2021Updated 4 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆86Apr 24, 2023Updated 2 years ago
- 💫 Static page generator with routes support thats infinitely awesome☆11Feb 7, 2018Updated 8 years ago
- Search-Category-And-Info-Detail API☆12Mar 7, 2023Updated 3 years ago
- The complete Buddycloud stack in a VM☆23Mar 5, 2016Updated 10 years ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆58Aug 15, 2025Updated 7 months ago
- A free Twilio app to let Boston residents call their families while phone coverage is poor.☆90Mar 12, 2016Updated 10 years ago
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling☆150Aug 22, 2025Updated 7 months ago
- Awesome radiology report generation and image captioning papers.☆79Oct 15, 2024Updated last year