Qichuzyy / POALinks

Official implementation of ECCV24 paper: POA

☆24

Alternatives and similar repositories for POA

Users that are interested in POA are comparing it to the libraries listed below

Sorting:

shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆46Updated 5 months ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago
tianyi-lab / C3PO
Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆16Updated 3 months ago
pixeli99 / MixLN
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆25Updated last week
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆34Updated 4 months ago
ByungKwanLee / Phantom
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆60Updated 9 months ago
LCM-Lab / LCM_Stack
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆14Updated 5 months ago
SHI-Labs / OLA-VLM
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
☆60Updated 5 months ago
eric-ai-lab / ComCLIP
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆36Updated 11 months ago
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆76Updated 7 months ago
Infini-AI-Lab / S2FT
☆19Updated 7 months ago
MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 4 months ago
THU-KEG / LongWriter-V
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆19Updated 4 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆41Updated last month
DCDmllm / HyperLLaVA
Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
☆28Updated last year
linkedin / ControlLLM
Control LLM
☆19Updated 3 months ago
AnonymousAlethiometer / SGD_SaI
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆52Updated 6 months ago
TIGER-AI-Lab / PixelWorld
The official code of "PixelWorld: Towards Perceiving Everything as Pixels"
☆14Updated 5 months ago
mbzuai-oryx / PALO
(WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…
☆84Updated 5 months ago
jiwonsong-dev / ReasoningPathCompression
Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"
☆21Updated 2 months ago
philippe-eecs / small-vision
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Updated last year
philippe-eecs / vitok
☆34Updated 2 months ago
Adamdad / neumeta
NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…
☆43Updated 8 months ago
yale-nlp / refdpo
☆16Updated last year
WangFei-2019 / SNARE
Project for SNARE benchmark
☆11Updated last year
RWKV / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…
☆51Updated 4 months ago
HanSolo9682 / CounterCurate
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆18Updated last year
MLLM-Data-Contamination / MM-Detect
This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"
☆16Updated 3 weeks ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆46Updated last year