lookwei / COMP4423Links
Course materials for COMP 4423 - Computer Vision for Beginners at the Hong Kong Polytechnic University
☆28Updated last year
Alternatives and similar repositories for COMP4423
Users that are interested in COMP4423 are comparing it to the libraries listed below
Sorting:
- Multimodal Empathetic Chatbot☆43Updated last year
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆17Updated last year
- ☆43Updated 9 months ago
- ☆59Updated 2 months ago
- GPT-4V with Emotion☆94Updated last year
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆64Updated 6 months ago
- GPUSnatcher is a tool for GPU resource monitoring and snatching, designed to help users temporarily monitor and grab idle GPU resources.☆78Updated this week
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆30Updated 6 months ago
- ICLR2024 statistics☆48Updated last year
- My personal homepage☆100Updated this week
- Awsome works based on SSM and Mamba☆17Updated last year
- ☆42Updated 3 months ago
- Idempotent Generative Network's unofficial pytorch implementation☆45Updated last year
- Facial Action Unit Detection Model and Visualization Canvas☆26Updated 2 weeks ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆36Updated 6 months ago
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆26Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆83Updated 7 months ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆87Updated 7 months ago
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆25Updated last year
- ☆258Updated last year
- diffusion generative model☆194Updated 3 years ago
- Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.☆276Updated last year
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆19Updated 9 months ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆340Updated 5 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆28Updated this week
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆64Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆129Updated 9 months ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆56Updated this week
- ☆15Updated 2 years ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆131Updated last year