EmmaSRH/ARVFM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EmmaSRH/ARVFM)

EmmaSRH / ARVFM

Awesome autoregressive vision foundation models

☆26

Alternatives and similar repositories for ARVFM

Users that are interested in ARVFM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
shxie2020 / Awesome-UGVFM
View on GitHub
A collection of vision foundation models unifying understanding and generation.
☆60Jan 2, 2025Updated last year
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
junwan2014 / MMDN-master
View on GitHub
A pytorch implementation of "Robust Facial Landmark Detection by Multi-order Multi-constrained Network"
☆13Dec 9, 2020Updated 5 years ago
LINs-lab / ReLA
View on GitHub
[NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations
☆19Jan 19, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
pipilurj / perceptionGPT
View on GitHub
☆18Aug 7, 2024Updated last year
LINs-lab / awesome_papers
View on GitHub
☆20May 28, 2025Updated last year
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
MembrLab / MIU-VL
View on GitHub
This is the repository for the ICLR2023 accepted paper -- Medical Image Understanding With Pretrained VLM
☆31Jun 9, 2023Updated 3 years ago
Princeton-AI2-Lab / ZoomClick
View on GitHub
A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.
☆25Dec 8, 2025Updated 7 months ago
putshua / SNN-RAT
View on GitHub
☆12Oct 4, 2022Updated 3 years ago
liangling76 / snn_attack
View on GitHub
☆13Nov 30, 2021Updated 4 years ago
mchen725 / DD_IGD
View on GitHub
[ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".
☆15Feb 12, 2025Updated last year
apchenstu / SLN-Amodal
View on GitHub
[ACM MM2019] Learning Semantics-aware Distance Map with Semantics Layering Network for Amodal Instance Segmentation
☆32Sep 3, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ali-vilab / CDT
View on GitHub
Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
☆17Apr 2, 2025Updated last year
LINs-lab / FedBR
View on GitHub
[ICML 2023] FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction
☆29Mar 7, 2024Updated 2 years ago
HelmholtzAI-FZJ / flex_gen
View on GitHub
☆20Jan 10, 2025Updated last year
LINs-lab / GMem
View on GitHub
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆43Mar 11, 2025Updated last year
yiren-jian / BLIText
View on GitHub
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆26Dec 5, 2023Updated 2 years ago
LINs-lab / RCGM
View on GitHub
[ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation
☆39Feb 4, 2026Updated 5 months ago
zjwzcn07 / Statistical-Learning
View on GitHub
《统计学习方法_李航》每章算法的具体实现（不依赖与其他库）
☆11Feb 20, 2017Updated 9 years ago
BigNeuron / Code_and_Analysis_Tools
View on GitHub
☆12Mar 25, 2022Updated 4 years ago
HGao-cv / VADv2
View on GitHub
project page of "VAD v2: LLM-Like Probabilistic Modeling in End-to-End Autonomous Driving"
☆11Apr 9, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tliby / UniFork
View on GitHub
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
☆48Aug 26, 2025Updated 10 months ago
Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
Liuziyu77 / MMDU
View on GitHub
Official repository of MMDU dataset
☆108Sep 29, 2024Updated last year
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
G-U-N / consolver
View on GitHub
[CVPR 2026 (Highlight)] Unofficial Implementation of "Image Diffusion Preview with Consistency Solver"
☆30Jan 24, 2026Updated 6 months ago
SZUHvern / MaCo
View on GitHub
The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…
☆12Sep 13, 2024Updated last year
illume-unified-mllm / ILLUME_plus
View on GitHub
[CVPR2025] Official Implementation of ILLUME+
☆126Aug 20, 2025Updated 11 months ago
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated 2 years ago
bocklab / temca2data
View on GitHub
Public code for Zheng, Lauritzen et al. (2018)
☆15Mar 22, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LingDong- / lingcordion
View on GitHub
Pocket-sized digital musical instrument inspired by the piano and the accordion
☆13Oct 9, 2024Updated last year
lucasdegeorge / T2I-ImageNet
View on GitHub
Code for "How far can we go with ImageNet for Text-to-Image generation?" paper
☆97May 27, 2026Updated last month
i207M / Pomodoro-Improved-Strict-Workflow
View on GitHub
实践番茄工作法：工作时屏蔽浪费时间的网站，休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you bro…
☆13Jul 26, 2022Updated 3 years ago
EdenGabriel / TaskWeave
View on GitHub
[CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
☆30Sep 26, 2024Updated last year
bronyayang / Law_of_Vision_Representation_in_MLLMs
View on GitHub
[COLM'25] Official implementation of the Law of Vision Representation in MLLMs
☆177Oct 6, 2025Updated 9 months ago
shaoshitong / G_VBSM_Dataset_Condensation
View on GitHub
[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)
☆27Oct 9, 2024Updated last year
JoshuaChou2018 / MedAGI
View on GitHub
Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost
☆39Jun 21, 2023Updated 3 years ago