HL-hanlin / Bifrost-1View external linksLinks
Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)
☆44Nov 24, 2025Updated 2 months ago
Alternatives and similar repositories for Bifrost-1
Users that are interested in Bifrost-1 are comparing it to the libraries listed below
Sorting:
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Feb 18, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]☆14Oct 22, 2024Updated last year
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆17Feb 5, 2024Updated 2 years ago
- Transactions on Multimedia (TMM25)☆19Apr 8, 2025Updated 10 months ago
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆47Dec 25, 2025Updated last month
- ☆22Jun 17, 2025Updated 7 months ago
- Official implementation of "Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models"☆34Nov 30, 2025Updated 2 months ago
- Official Implementation for Generative Neural Fields by Mixtures of Neural Implicit Functions☆19Mar 10, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆23Mar 13, 2025Updated 11 months ago
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆26Nov 19, 2024Updated last year
- MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images☆26Apr 3, 2025Updated 10 months ago
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆31Jul 18, 2025Updated 6 months ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆28Feb 14, 2024Updated 2 years ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 5 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆195Dec 17, 2025Updated last month
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- ☆30May 9, 2024Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 5 months ago
- ☆17Sep 23, 2025Updated 4 months ago
- Tutorial on using Hugging Face's Vision Transformers for Image Classification☆10Sep 4, 2021Updated 4 years ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆234Jan 22, 2026Updated 3 weeks ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆223Updated this week
- ☆33Feb 29, 2024Updated last year
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆508Nov 14, 2025Updated 3 months ago
- UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture☆87Feb 5, 2026Updated last week
- Official implementation of USR (NeurIPS 2024)☆39Dec 21, 2024Updated last year
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 8 months ago
- Whether you're a beginner exploring LangChain or an advanced practitioner building scalable GenAI applications, this tutorial-style proje…☆12Updated this week
- Tutorial for Graph Neural Network at APBJC 2024.☆10Apr 21, 2025Updated 9 months ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆35Dec 9, 2025Updated 2 months ago
- [TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"☆15Apr 20, 2025Updated 9 months ago
- Pytorch implementation of Self-Refining Video Sampling☆144Feb 6, 2026Updated last week
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 6 months ago
- ☆43May 30, 2025Updated 8 months ago
- Pixel-Space Generative Models☆301May 11, 2025Updated 9 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 5 months ago
- YoloTeeth is a GitHub repository dedicated to leveraging YOLOv8 for precise instance segmentation and object detection in teeth X-ray ima…☆11Nov 10, 2024Updated last year
- ☆11Jan 18, 2025Updated last year