rangarodrigo / EN1060Lectures
EN1060 lectures
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for EN1060Lectures
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…☆45Updated last year
- Official code repository of paper titled "Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Visio…☆19Updated 2 months ago
- Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".☆40Updated 2 months ago
- [InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Cl…☆30Updated 2 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆25Updated 2 months ago
- Official implementation of the paper "STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models"☆15Updated 2 months ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆248Updated 7 months ago
- ☆13Updated 8 months ago
- [CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆90Updated 2 months ago
- A curated list of awesome self-supervised learning methods in videos☆112Updated this week
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆22Updated 2 months ago
- A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆18Updated this week
- ☆13Updated 4 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆95Updated 9 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆88Updated 6 months ago
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀☆31Updated last month
- Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"☆46Updated last week
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆84Updated 10 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆68Updated 5 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆73Updated 3 months ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆116Updated 2 months ago
- ☆21Updated last month
- Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"☆80Updated 3 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆211Updated 2 weeks ago
- ☆53Updated 3 months ago
- [EMNLP 2024] Official code repository of paper titled "PALM: Few-Shot Prompt Learning for Audio Language Models" accepted in EMNLP 2024 c…☆19Updated last week
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆75Updated last month
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accep…☆53Updated 3 weeks ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆37Updated 2 months ago
- ☆112Updated 3 months ago