levymsn / CQA-CRCT
Official PyTorch implementation for ״ lassification-Regression for Chart Comprehension״
☆26Updated 2 months ago
Alternatives and similar repositories for CQA-CRCT:
Users that are interested in CQA-CRCT are comparing it to the libraries listed below
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆79Updated last week
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning☆16Updated 3 months ago
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆597Updated 10 months ago
- Official PyTorch Implementation for the "Model Tree Heritage Recovery" paper.☆57Updated 9 months ago
- Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.☆139Updated 2 years ago
- CLIPScore EMNLP code☆221Updated 2 years ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆15Updated last month
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆33Updated 9 months ago
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆575Updated 10 months ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆250Updated 2 weeks ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆195Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆257Updated last year
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆172Updated 11 months ago
- Official PyTorch Implementation for the "A Deep Inverse-Mapping Model for a Flapping Robotic Wing" Paper (ICLR 2025)☆15Updated last month
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆81Updated 10 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆160Updated 11 months ago
- ☆107Updated 2 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated 10 months ago
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆198Updated last year
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆121Updated 9 months ago
- ☆76Updated 2 years ago
- 🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".☆452Updated last year
- Densely Captioned Images (DCI) dataset repository.☆177Updated 9 months ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆89Updated last year
- ☆79Updated 5 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆70Updated 11 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆38Updated 10 months ago
- 🚀 Cross attention map tools for huggingface/diffusers☆262Updated 3 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆125Updated 10 months ago
- [CVPR 2023] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation☆260Updated last year