g-luo / task_vectors_are_cross_modal

Official PyTorch Implementation for Task Vectors are Cross-Modal

☆22

Alternatives and similar repositories for task_vectors_are_cross_modal:

Users that are interested in task_vectors_are_cross_modal are comparing it to the libraries listed below

RobertCsordas / moeut
☆78Updated 8 months ago
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆12Updated 9 months ago
Infini-AI-Lab / S2FT
☆17Updated 3 months ago
katiekang1998 / reasoning_generalization
☆31Updated 3 months ago
dangxingyu / rnn-icrag
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆26Updated last year
euclid-multimodal / Euclid
☆14Updated 3 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆29Updated last month
apple / ml-rpm-bench
☆41Updated 9 months ago
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆75Updated 4 months ago
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆54Updated 8 months ago
locuslab / llava-token-compression
☆41Updated 5 months ago
Qichuzyy / POA
Official implementation of ECCV24 paper: POA
☆24Updated 8 months ago
EvolvingLMMs-Lab / multimodal-sae
Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆127Updated 3 months ago
tianyi-lab / R2-T2
Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆15Updated last month
pixeli99 / MixLN
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆19Updated 4 months ago
shulin16 / MMInA
Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"
☆42Updated last month
AtsuMiyai / UPD
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
☆75Updated 7 months ago
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆25Updated 5 months ago
Gabesarch / ICAL
☆36Updated last month
facebookresearch / multimodal_rewardbench
Multimodal RewardBench
☆38Updated 2 months ago
John-AI-Lab / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆27Updated last week
orrzohar / Video-STaR
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆60Updated 9 months ago
pliang279 / HEMM
Holistic evaluation of multimodal foundation models
☆47Updated 8 months ago
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆37Updated 6 months ago
zhixuan-lin / forgetting-transformer
[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"
☆95Updated 2 weeks ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 7 months ago
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆39Updated 6 months ago
Cranial-XIX / longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆50Updated 4 months ago
thunlp / DeepPerception
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
☆49Updated 3 weeks ago
beichenzbc / BoostStep
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆35Updated 3 months ago