MIV-XJTU/FLAME

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MIV-XJTU/FLAME)

MIV-XJTU / FLAME

[CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"

☆33

Alternatives and similar repositories for FLAME

Users that are interested in FLAME are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
Should-AI-Lab / GRID
View on GitHub
The official implementation of 'GRID: Visual Layout Generation.'
☆21Dec 28, 2024Updated last year
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆22Oct 8, 2024Updated last year
zhaozeyang108 / Oriented-DETR
View on GitHub
The Project of ECCV 2024 Oral Paper "Oriented Object Detection vis Point-Axis Representation"
☆79Dec 12, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
MIV-XJTU / SPEED
View on GitHub
PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.
☆20Jun 28, 2024Updated 2 years ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
icon-lab / MedTrim
View on GitHub
Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"
☆15Mar 27, 2026Updated 3 months ago
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
rabiulcste / vismin
View on GitHub
[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
RAIVNLab / sugar-crepe
View on GitHub
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆93Feb 13, 2024Updated 2 years ago
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
vinid / neg_clip
View on GitHub
NegCLIP.
☆41Feb 6, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
rajpurkarlab / ReXKG
View on GitHub
☆17Sep 23, 2024Updated last year
MIV-XJTU / EvoPrompt
View on GitHub
PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).
☆13Apr 15, 2024Updated 2 years ago
AdamRain / YFCC15M_downloader
View on GitHub
A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).
☆19Nov 13, 2024Updated last year
WeixiongLin / Build-PMC-OA
View on GitHub
The official code to build up dataset PMC-OA
☆34Jul 16, 2024Updated 2 years ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
ChenXiaoFei-CS / KoBo
View on GitHub
Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】
☆16Mar 19, 2024Updated 2 years ago
tangyuhao2016 / CTRG
View on GitHub
☆19Aug 21, 2023Updated 2 years ago
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
SiyuanYan1 / MAKE
View on GitHub
[MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
☆23Feb 27, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mlii0117 / FFA-IR
View on GitHub
The official start-up code for paper "FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark."
☆67Jan 21, 2025Updated last year
Lichang-Chen / AlpaGasus
View on GitHub
A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)
☆24Jul 26, 2024Updated last year
tangzhengxu2001 / m4oe
View on GitHub
☆16Apr 3, 2025Updated last year
liubo105 / SAT
View on GitHub
Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage
☆11Jun 25, 2023Updated 3 years ago
LHL3341 / ContextBLIP
View on GitHub
ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]
☆11May 17, 2024Updated 2 years ago
MrRezaeiUofT / AMG-RAG
View on GitHub
AMG-RAG (Agentic Medical Graph-RAG) is a comprehensive framework that automates the construction and continuous updating of Medical Knowl…
☆37Feb 5, 2026Updated 5 months ago
yhygao / Explicd
View on GitHub
☆18Sep 19, 2024Updated last year
yangzhou12 / BenchX
View on GitHub
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
☆49Dec 27, 2025Updated 6 months ago
Yangyi-Chen / SOLO
View on GitHub
[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
☆150Nov 14, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lezhang7 / TreeMix
View on GitHub
[NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
☆10Jul 15, 2023Updated 3 years ago
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
MAGIC-AI4Med / MAP
View on GitHub
☆16Jul 2, 2026Updated 2 weeks ago
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆28Mar 15, 2026Updated 4 months ago
lezhang7 / Enhance-FineGrained
View on GitHub
[CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
☆56Apr 7, 2025Updated last year
MrGiovanni / RT-Super
View on GitHub
[MICCAI 2026] A longitudinal, multimodal algorithm for multi-tumor segmentation (learning from reports).
☆15Jun 29, 2026Updated 3 weeks ago