xianzhangzx/FINER-MLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xianzhangzx/FINER-MLLM)

xianzhangzx / FINER-MLLM

The implementation of FINER-MLLM, which is accepted by MM2024.

☆18

Alternatives and similar repositories for FINER-MLLM

Users that are interested in FINER-MLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago
iLearn-Lab / MM2023-FGKVMemPred_video
View on GitHub
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
☆23Jul 11, 2024Updated 2 years ago
xiaojieli0903 / CKPD-FSCIL
View on GitHub
[ACM MM 2026] Official implementation of “Continuous Knowledge-Preserving Decomposition with Adaptive Layer Selection for Few-Shot Class-…
☆34Jul 12, 2026Updated 2 weeks ago
iLearn-Lab / MM2023-MaskAgain
View on GitHub
Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)
☆27Jul 11, 2024Updated 2 years ago
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
iboing / CorDA
View on GitHub
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)
☆59Jan 13, 2025Updated last year
layumi / ICME2022SS
View on GitHub
ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”
☆12Jun 3, 2024Updated 2 years ago
csyizhou / Vehicle-Re-ID
View on GitHub
Viewpoint-aware Attentive Multi-view Inference for Vehicle Re-identification
☆13Mar 27, 2019Updated 7 years ago
vec-ai / wikiHow-TIIR
View on GitHub
[ACL 2025] Towards Text-Image Interleaved Retrieval
☆16Sep 3, 2025Updated 10 months ago
vl2g / CSTBIR
View on GitHub
Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
iLearn-Lab / ICML24-RoboMP2
View on GitHub
[ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…
☆12Apr 4, 2026Updated 3 months ago
icq-benchmark / icq-benchmark
View on GitHub
☆19Jul 28, 2025Updated last year
LgQu / DPT-T2I
View on GitHub
Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation
☆33Mar 30, 2025Updated last year
double125 / Graph-Matching-Attention
View on GitHub
Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
☆11Feb 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ning-mz / SCA-GPS
View on GitHub
Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems
☆16Dec 27, 2023Updated 2 years ago
yangcong356 / KCFI
View on GitHub
This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"
☆25Apr 2, 2026Updated 3 months ago
gabrielchua / embedding-adapter
View on GitHub
A lightweight open-source package to fine-tune embedding models.
☆22Feb 4, 2024Updated 2 years ago
Muccul / AddRain-CycleGAN
View on GitHub
Add Rain Streak Mask On Unparied Image Using GAN
☆10Sep 12, 2020Updated 5 years ago
layumi / ACMMM2023Workshop
View on GitHub
UAVM @ ACM MM2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective
☆17Apr 30, 2025Updated last year
ZJU-DAILY / Metric_Index
View on GitHub
This repository contains the code of metric indexing for exact similarity search.
☆12Jul 11, 2023Updated 3 years ago
DeployQL / awesome-multi-vector
View on GitHub
A list of multi-vector retrieval resources
☆19May 29, 2024Updated 2 years ago
scloudyy / Defogging
View on GitHub
A python package of robust and effective defogging/dehazing method
☆15Dec 30, 2018Updated 7 years ago
CVer-Yang / HCNet
View on GitHub
☆14Sep 28, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
HITsz-TMG / SKURG
View on GitHub
☆20Nov 4, 2023Updated 2 years ago
OpenMatch / UniVL-DR
View on GitHub
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…
☆52Jul 3, 2024Updated 2 years ago
iLearn-Lab / SIGIR24-FTI4CIR
View on GitHub
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆27Apr 9, 2026Updated 3 months ago
miaoyuchun / InfoRM
View on GitHub
The official implementation of InfoRM [NeurIPS 2024].
☆16Oct 25, 2025Updated 9 months ago
jlparkI / mix_T
View on GitHub
Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (vari…
☆11Sep 23, 2025Updated 10 months ago
krystalan / AwesomeSEG
View on GitHub
A curated list of Story Ending Generation models; DASFAA'22: Incorporating Commonsense Knowledge into Story Ending Generation via Heterog…
☆15May 12, 2022Updated 4 years ago
ADLab-AutoDrive / ICKD
View on GitHub
Offical Code for Paper "Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation"
☆17Jan 19, 2022Updated 4 years ago
pransen / ComputerVisionAlgorithms
View on GitHub
☆17Sep 27, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tomchenshi / CYformer
View on GitHub
The pytorch code of hyperspectral and multispectral image fusion method cyformer.
☆15Aug 25, 2025Updated 11 months ago
ekzhu / josie
View on GitHub
Code and Benchmarks for JOSIE (SIGMOD 2019)
☆20Apr 13, 2023Updated 3 years ago
AggMan96 / RK-Net
View on GitHub
Code for RK-Net
☆32Mar 26, 2023Updated 3 years ago
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
chenyuntc / keypoint
View on GitHub
Implemention of "Realtime Multi Person Pose-Estimation" in pytorch with data from AI Challenger
☆13Nov 24, 2017Updated 8 years ago
Huntersxsx / MGPN
View on GitHub
source code of our MGPN in SIGIR 2022
☆18Jun 8, 2022Updated 4 years ago
ZJU-DAILY / PSAMS
View on GitHub
Source code for Pivot Selection Algorithms in Metric Spaces: An Experimental Evaluation. VLDBJ 2021.
☆15Jul 27, 2021Updated 5 years ago