wangyuchi369/LaDiC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangyuchi369/LaDiC)

wangyuchi369 / LaDiC

[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?

☆42

Alternatives and similar repositories for LaDiC

Users that are interested in LaDiC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyuchi369 / RICO
View on GitHub
Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…
☆21Dec 9, 2025Updated 7 months ago
JianhongBai / COLT
View on GitHub
Official implementation of "On the Effectiveness of Out-of-Distribution Data in Self-Supervised Long-Tail Learning" (ICLR 2023)
☆15Jul 15, 2023Updated 3 years ago
JianhongBai / BaCon
View on GitHub
Official implementation of "Towards Distribution-Agnostic Generalized Category Discovery" (NIPS 2023)
☆29Oct 21, 2023Updated 2 years ago
yuPeiyu98 / Diffusion-Amortized-MCMC
View on GitHub
[NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
☆14Mar 1, 2026Updated 4 months ago
ZamDimon / distortion-generator
View on GitHub
Neural network for creating distortion while keeping embeddings as close as possible
☆20Feb 6, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LinghaoChan / HumanTOMATO
View on GitHub
Web page for "🍅HumanTOMATO: Text-aligned Whole-body Motion Generation".
☆15May 25, 2024Updated 2 years ago
Ming-er / LGC-SED
View on GitHub
☆13Jan 3, 2024Updated 2 years ago
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
KunhangL / finemotiondiffuse
View on GitHub
Motion Generation from Fine-grained Textual Descriptions (LREC-COLING 2024)
☆15Jun 13, 2024Updated 2 years ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
beeevita / Classical-Chinese-NER-RE-Dataset
View on GitHub
A dataset used for NLP tasks.
☆10Apr 17, 2021Updated 5 years ago
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
MonkeySeeDoCG / MoMo-code
View on GitHub
☆12Jul 27, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆15Jul 4, 2025Updated last year
cuongle1206 / OSDCap
View on GitHub
[NeurIPS24] Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos
☆23May 30, 2026Updated last month
HM4725 / The-Art-of-Multiprocessor-Programming
View on GitHub
THE ART of MULTIPROCESSOR PROGRAMMING, Maurice Herlihy & Nir Shavit
☆11Feb 12, 2023Updated 3 years ago
pkunlp-icler / PCA-EVAL
View on GitHub
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆107Mar 14, 2024Updated 2 years ago
zixiangzhou916 / UDE-2
View on GitHub
☆22Apr 17, 2024Updated 2 years ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
jonflynng / qwen2-audio-finetune
View on GitHub
Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.
☆24Nov 23, 2024Updated last year
taewhankim / VIPCAP
View on GitHub
☆15Dec 31, 2024Updated last year
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
ljzycmd / SCD
View on GitHub
Consistent Human Image and Video Generation with Spatially Conditioned Diffusion
☆16Sep 1, 2025Updated 10 months ago
zixiangzhou916 / AvatarGPT
View on GitHub
☆50May 20, 2024Updated 2 years ago
boreng0817 / IFCap
View on GitHub
[EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
☆15May 13, 2025Updated last year
lizishi / DeTRC
View on GitHub
☆19Jun 23, 2026Updated last month
YoucanBaby / VTG-GPT
View on GitHub
[AAAI 2025] VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
☆112Jan 13, 2026Updated 6 months ago
furiosa-ai / uncage
View on GitHub
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆17Aug 12, 2025Updated 11 months ago
Ava4Everr / CodeHS-Java-APCSA
View on GitHub
Just a copy of https://github.com/RobynE23/CodeHS-Java-APCSA, but I added folders and some extra files that didn't exist. Another option …
☆27Jan 23, 2024Updated 2 years ago
MrNobodyCali / Con4m
View on GitHub
Official implementation of [NeurIPS 2024] Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification
☆15May 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
jinxiang-liu / UFE-AVS
View on GitHub
Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
☆19Jul 7, 2024Updated 2 years ago
researchmm / AAST-pytorch
View on GitHub
[MM'20] Aesthetic-Aware Image Style Transfer
☆15Sep 16, 2021Updated 4 years ago
SiLangWHL / vMF-OP
View on GitHub
☆16Oct 2, 2022Updated 3 years ago
MediaBrain-SJTU / OC_LT
View on GitHub
Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024
☆19Jul 11, 2024Updated 2 years ago
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year