The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders
☆42Jun 10, 2025Updated 10 months ago
Alternatives and similar repositories for LIFT
Users that are interested in LIFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 2, 2025Updated last year
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- official code for unigame☆19Nov 26, 2025Updated 4 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated last year
- ☆22Jul 23, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆299Nov 5, 2025Updated 5 months ago
- ☆35Feb 15, 2026Updated last month
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 5 months ago
- ☆15Jan 12, 2026Updated 2 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- An open source implementation of CLIP (With TULIP Support)☆164May 14, 2025Updated 10 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆133May 16, 2025Updated 10 months ago
- [JAG 2026] DreamCD: A change-label-free framework for change detection via a weakly conditional semantic diffusion model in optical VHR i…☆24Jan 30, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)☆18Jul 20, 2025Updated 8 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated 2 months ago
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆38Sep 28, 2025Updated 6 months ago
- Siggraph 2025 Journal track☆25Aug 13, 2025Updated 7 months ago
- Control LLM☆22Apr 6, 2025Updated last year
- ☆24May 23, 2025Updated 10 months ago
- ☆33Apr 22, 2025Updated 11 months ago
- Official code repo of SimMLM [ICCV 2025]☆24Dec 1, 2025Updated 4 months ago
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…☆15Jul 7, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- ☆31Sep 19, 2025Updated 6 months ago
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆105Jan 27, 2026Updated 2 months ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 4 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework☆18Sep 8, 2025Updated 7 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆78Mar 7, 2026Updated last month
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆27Nov 7, 2025Updated 5 months ago
- ☆52Jan 13, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆105Mar 10, 2026Updated last month
- [ICLR 2025 Spotlight] Official Implementation for ToST (Token Statistics Transformer)☆133Feb 25, 2025Updated last year
- [NeurIPS 2025🔥:] EVODiff is an inference-time refinement method for diffusion models that improves sampling efficiency and generative f…☆31Feb 2, 2026Updated 2 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- PIPClass是一个基于自载ViT(Vision Transformer)模型的comfyui使用框架,专为图像分类设计。它不仅可以快速识别图像中的内容,还能提供详细的分类得分,帮助用户更好地理解模型的判断依据。PIP_ClassV1 模型是基于ViT-B/16预训练模型…☆11Mar 14, 2025Updated last year
- [ICCV 2025] CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers☆16Mar 3, 2026Updated last month
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆47Jun 3, 2024Updated last year