g-luo/vlm_cross_modal_reps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/g-luo/vlm_cross_modal_reps)

g-luo / vlm_cross_modal_reps

Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025

☆34

Alternatives and similar repositories for vlm_cross_modal_reps

Users that are interested in vlm_cross_modal_reps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
elad-amrani / xtra
View on GitHub
PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025
☆14Nov 21, 2025Updated 8 months ago
OpenGVLab / TPO
View on GitHub
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
☆65Jul 22, 2025Updated last year
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AIRI-Institute / LLM-Microscope
View on GitHub
☆62Mar 3, 2025Updated last year
BatsResearch / cross-lingual-detox
View on GitHub
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024
☆18Mar 25, 2025Updated last year
Brandon3964 / MultiModal-Task-Vector
View on GitHub
[NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"
☆27Apr 8, 2025Updated last year
FatemehShiri / Spatial-MM
View on GitHub
☆12Jan 10, 2025Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆20Aug 21, 2025Updated 11 months ago
SAP-archive / acl2020-commonsense
View on GitHub
Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.
☆29Aug 2, 2024Updated last year
Dreamyao516 / DialogueLLM
View on GitHub
☆10Jan 18, 2024Updated 2 years ago
luo-junyu / RobustFT
View on GitHub
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
☆44Dec 20, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
namhoonlee / spp-public
View on GitHub
A Signal Propagation Perspective for Pruning Neural Networks at Initialization
☆14Jun 23, 2020Updated 6 years ago
facebookresearch / ViP-MAE
View on GitHub
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆37Jun 27, 2023Updated 3 years ago
tegg89 / Deep-blogs
View on GitHub
A curated lists of self-taught materials including research blogs
☆16Dec 12, 2016Updated 9 years ago
shimisalant / CWR
View on GitHub
Author implementation of "Contextualized Word Representations for Reading Comprehension" (Salant et al. 2017)
☆11Jun 14, 2018Updated 8 years ago
Sueqk / LMM-VQA
View on GitHub
LMM for VQA, tcsvt version
☆10Jul 19, 2024Updated 2 years ago
g-luo / geolocation_via_guidebook_grounding
View on GitHub
G^3: Geolocation via Guidebook Grounding, Findings of EMNLP 2022
☆17Sep 10, 2024Updated last year
mishajw / repeng
View on GitHub
Experiments with representation engineering
☆14Feb 28, 2024Updated 2 years ago
ByungKwanLee / Phantom
View on GitHub
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆63Oct 9, 2024Updated last year
itsmag11 / Omegance
View on GitHub
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)
☆52Jan 14, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tamangmilan / llama3
View on GitHub
Building Llama 3 from scratch using PyTorch
☆13Sep 1, 2024Updated last year
dido1998 / CTRL-O
View on GitHub
☆23Jun 17, 2025Updated last year
BoAi01 / embodiment-scaling-laws
View on GitHub
Implementation of the paper on Embodiment Scaling Laws in Robot Locomotion (CoRL 2025)
☆27Jul 21, 2026Updated last week
WangRongsheng / LLM101
View on GitHub
This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…
☆24May 5, 2025Updated last year
Parker-rfu / SeLaReasoning
View on GitHub
[ACL 2026 oral] SeLaR: Selective Latent Reasoning in Large Language Models
☆21Apr 25, 2026Updated 3 months ago
cvenhoff / steering-thinking-llms
View on GitHub
☆39Jul 9, 2025Updated last year
lasgroup / user_interactions
View on GitHub
Aligning Language Models from User Interactions via Self-Distillation
☆26Mar 31, 2026Updated 3 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
qingze-bai / XctDiff
View on GitHub
☆17Nov 20, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
quicksviewer / quicksviewer
View on GitHub
☆19Jun 29, 2025Updated last year
alhojel / visual_task_vectors
View on GitHub
☆41Jul 19, 2024Updated 2 years ago
claudia-viaro / Wdss-UCLdss_research
View on GitHub
☆12Aug 31, 2022Updated 3 years ago
mbzuai-nlp / Fakenews-dataset
View on GitHub
☆17Apr 7, 2024Updated 2 years ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago