lookwei / COMP4423
Course materials for COMP 4423 - Computer Vision for Beginners at the Hong Kong Polytechnic University
☆28Updated last year
Alternatives and similar repositories for COMP4423:
Users that are interested in COMP4423 are comparing it to the libraries listed below
- ☆111Updated 3 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆68Updated 2 months ago
- [IJCAI 2024] Continual Multimodal Knowledge Graph Construction☆49Updated 5 months ago
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆24Updated 8 months ago
- 机器学习乐园:主要包括机器学习基础,深度学习实践,工业应用。☆15Updated 2 years ago
- Deformable Graph Convolutional Networks (Author's PyTorch implementation for the AAAI 2022 paper)☆28Updated 2 years ago
- Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…☆38Updated 2 years ago
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆17Updated 4 months ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆28Updated 2 months ago
- ☆43Updated this week
- ☆39Updated this week
- Personal PolyU COMP UG Subject Archive☆15Updated 3 months ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆44Updated last month
- Explanation of the llama2 repo.☆10Updated 9 months ago
- A collection of omni-mllm☆25Updated last week
- ☆18Updated 5 months ago
- Multimodal Empathetic Chatbot☆37Updated 9 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆63Updated 2 months ago
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆62Updated 11 months ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Updated last year
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated 2 months ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆72Updated last year
- [ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings☆19Updated last year
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆47Updated 4 months ago
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆14Updated 10 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated 2 months ago
- [ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning☆65Updated 10 months ago
- Narrative movie understanding benchmark☆70Updated 11 months ago
- This repository contains the code for AdaCLIP, a computation and latency-aware system for pragmatic multimodal video retrieval.☆10Updated 11 months ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆15Updated 2 months ago