HL-hanlin/Bifrost-1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HL-hanlin/Bifrost-1)

HL-hanlin / Bifrost-1

Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)

☆47

Alternatives and similar repositories for Bifrost-1

Users that are interested in Bifrost-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mbzuai-oryx / EvoLMM
View on GitHub
Self Evolving Large Multimodal Models with Continuous Rewards
☆25Jun 9, 2026Updated last month
TuringEyeTest / TuringEyeTest
View on GitHub
Pixels, Patterns, but no Poetry: To See the World like Humans
☆18Aug 11, 2025Updated 11 months ago
naver-ai / lut
View on GitHub
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆14Dec 1, 2024Updated last year
HL-hanlin / V-Co
View on GitHub
Official implementation of V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising (ECCV 2026)
☆27Jun 29, 2026Updated last month
Pose-Group / MPT
View on GitHub
☆12Apr 26, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆325Oct 12, 2025Updated 9 months ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆693Feb 27, 2026Updated 5 months ago
inclusionAI / GroveMoE
View on GitHub
☆24Aug 20, 2025Updated 11 months ago
jialuli-luka / SELMA
View on GitHub
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆35Mar 12, 2024Updated 2 years ago
Arhosseini77 / ADDNN_2023
View on GitHub
Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran
☆11Feb 18, 2024Updated 2 years ago
csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
leiluk1 / gaze-based-segmentation
View on GitHub
Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]
☆14Oct 22, 2024Updated last year
OpenIXCLab / CODA
View on GitHub
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆37Aug 28, 2025Updated 11 months ago
AMAP-ML / S2-Guidance
View on GitHub
[ICLR2026] Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
☆158May 14, 2026Updated 2 months ago
aditiii12 / SharpEuler
View on GitHub
Official implementation of Sharpness-Aware Flow Matching
☆17Updated this week
AniAggarwal / ecad
View on GitHub
[ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆30Mar 1, 2026Updated 4 months ago
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
mansicer / self-verification
View on GitHub
☆18Dec 23, 2025Updated 7 months ago
Arhosseini77 / dgm_course_2023
View on GitHub
Deep Generative Models, University of Tehran, Dr.Tavassolipour
☆17Feb 5, 2024Updated 2 years ago
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆42Jun 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
msu-video-group / NTIRE26_Saliency_Prediction
View on GitHub
CVPR-NTIRE 2026 Challenge on Video Saliency Prediction
☆17Mar 20, 2026Updated 4 months ago
google-research-datasets / uicrit
View on GitHub
UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…
☆27Nov 19, 2024Updated last year
TaatiTeam / Token-Perturbation-Guidance
View on GitHub
Official implementation of "Token Perturbation Guidance for Diffusion Models" [NeurIPS 2025]
☆17May 19, 2026Updated 2 months ago
manoja328 / TallyQA_dataset
View on GitHub
TallyQA: Answering Complex Counting Questions dataset
☆31Feb 19, 2024Updated 2 years ago
casiatao / LPO
View on GitHub
The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.
☆19May 22, 2025Updated last year
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 8 months ago
butkej / MIL4Cyto
View on GitHub
Released code for the paper 'End-to-end Multiple Instance Learning for Whole-Slide Cytopathology of Urothelial Carcinoma'
☆10Nov 24, 2021Updated 4 years ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NJU-PCALab / UltraHR-100k
View on GitHub
This is the official repository of UltraHR-100K.
☆45Nov 21, 2025Updated 8 months ago
Yui010206 / VEGGIE-VidEdit
View on GitHub
[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
☆34Aug 18, 2025Updated 11 months ago
AIGeeksGroup / UniVid
View on GitHub
UniVid: The Open-Source Unified Video Model
☆32Oct 13, 2025Updated 9 months ago
Hungryyan1 / UniCorn
View on GitHub
☆80Apr 12, 2026Updated 3 months ago
liuting20 / SwimVG
View on GitHub
Transactions on Multimedia (TMM25)
☆21Apr 8, 2025Updated last year
TencentARC / TokLIP
View on GitHub
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
☆236Aug 18, 2025Updated 11 months ago
EIT-NLP / Layer_Select_Fuse_for_MLLM
View on GitHub
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆48Oct 29, 2025Updated 9 months ago