lookwei / COMP4423Links
Course materials for COMP 4423 - Computer Vision for Beginners at the Hong Kong Polytechnic University
☆28Updated last year
Alternatives and similar repositories for COMP4423
Users that are interested in COMP4423 are comparing it to the libraries listed below
Sorting:
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆17Updated 6 months ago
- ☆111Updated 4 months ago
- ☆15Updated 11 months ago
- ☆12Updated last month
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆34Updated 3 months ago
- ☆23Updated last year
- [CVPR2024] ModaVerse: Efficiently Transforming Modalities with LLMs☆29Updated 11 months ago
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆14Updated 11 months ago
- My slides and examples for bachelor deep learning course☆12Updated 3 years ago
- An implacation of SignGraph: A Sign Sequence is Worth Graphs of Nodes (CVPR2024)☆24Updated 5 months ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆33Updated last week
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆25Updated last year
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆15Updated 3 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆69Updated 4 months ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆68Updated last year
- ☆17Updated 7 months ago
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆28Updated last year
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆62Updated 3 months ago
- ☆54Updated last year
- This is an official PyTorch implementation of "Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximiza…☆24Updated last year
- Multimodal Empathetic Chatbot☆40Updated 10 months ago
- ☆36Updated 2 months ago
- [ACL 2025 Main] Multi-Agent System for Science of Science☆82Updated last week
- ☆39Updated 2 weeks ago
- ☆15Updated 2 years ago
- TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆42Updated 2 weeks ago
- 中文心理健康对话大模型 PsycoLLM☆39Updated 3 weeks ago
- LLMBind: A Unified Modality-Task Integration Framework☆18Updated 11 months ago
- 🔥 Omni large models and datasets for understanding and generating multi-modalities.☆15Updated 7 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆36Updated last year