wangyuchi369/RICO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangyuchi369/RICO)

wangyuchi369 / RICO

Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

☆21

Alternatives and similar repositories for RICO

Users that are interested in RICO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyuchi369 / LaDiC
View on GitHub
[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?
☆42Jun 9, 2024Updated 2 years ago
JianhongBai / BaCon
View on GitHub
Official implementation of "Towards Distribution-Agnostic Generalized Category Discovery" (NIPS 2023)
☆29Oct 21, 2023Updated 2 years ago
JianhongBai / UniEdit
View on GitHub
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
☆121Apr 16, 2025Updated last year
Ingrid789 / SkillMimic-V2
View on GitHub
[SIGGRAPH 2025] SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations
☆154Jul 24, 2025Updated 11 months ago
beeevita / Classical-Chinese-NER-RE-Dataset
View on GitHub
A dataset used for NLP tasks.
☆10Apr 17, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
llyx97 / video_reason_bench
View on GitHub
[ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…
☆41Jan 30, 2026Updated 5 months ago
beeevita / EvoPrompt
View on GitHub
Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
☆248Sep 22, 2025Updated 9 months ago
GaryJiajia / TSG
View on GitHub
[ACL 2023] Transforming Visual Scene Graphs to Image Captions
☆10Dec 13, 2023Updated 2 years ago
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
zechao-li / SVF-few-shot-segmentation
View on GitHub
☆22May 16, 2023Updated 3 years ago
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆20Jun 2, 2025Updated last year
AIM3-RUC / Youmakeup_Challenge2022
View on GitHub
☆17Jun 15, 2022Updated 4 years ago
uf-robopi / UStyle
View on GitHub
Waterbody style transfer of underwater imagery (JOE 2025)
☆26Dec 12, 2025Updated 7 months ago
AIM3-RUC / Youmakeup_Baseline
View on GitHub
☆20Jul 27, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EvolvingLMMs-Lab / VideoMMMU
View on GitHub
Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
☆72Sep 5, 2025Updated 10 months ago
kdu4108 / context-vs-prior-finetuning
View on GitHub
☆15May 27, 2025Updated last year
HaozheZhao / MIC_tool
View on GitHub
☆14Nov 14, 2023Updated 2 years ago
Aman-4-Real / MMTG
View on GitHub
[ACM MM 2022] (Oral): Multi-Modal Experience Inspired AI Creation
☆21Nov 27, 2024Updated last year
ZiyiZhang27 / HRNeXt
View on GitHub
[IEEE TMM] Code for the paper "HRNeXt: High-Resolution Context Network for Crowd Pose Estimation"
☆10Feb 24, 2023Updated 3 years ago
calmiLovesAI / ComputerVision.pytorch
View on GitHub
计算机视觉
☆13Nov 13, 2023Updated 2 years ago
academicportfolio / academicportfolio.github.io
View on GitHub
Github Pages template for academic portfolio websites
☆17Oct 22, 2024Updated last year
kennethwdk / SAR
View on GitHub
Code for "Spatial-Aware Regression for Keypoint Localization", CVPR 2024 Highlight
☆19Jun 15, 2024Updated 2 years ago
KlingAIResearch / SynCamMaster
View on GitHub
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
☆691May 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GongyeLiu / Awesome-Alignment-of-Diffusion-Models
View on GitHub
paper collection: alignment of diffusion models
☆29Mar 6, 2026Updated 4 months ago
AI-secure / FedGame
View on GitHub
Official implementation for paper "FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning" (NeurIPS 2023).
☆13Oct 25, 2024Updated last year
Ac-cool / AMAN
View on GitHub
☆12May 21, 2019Updated 7 years ago
lxtGH / Panoptic-PartFormer
View on GitHub
[ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation
☆63Sep 2, 2024Updated last year
MartyrPenink / SDPose
View on GitHub
Official implementation for 'SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation' on CVPR 2024
☆18May 15, 2024Updated 2 years ago
AIM3-RUC / YouMakeup
View on GitHub
☆29Apr 8, 2020Updated 6 years ago
THU-KEG / Crab
View on GitHub
[CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models
☆18May 23, 2025Updated last year
idanshen / Value-Augmented-Sampling
View on GitHub
☆20May 16, 2024Updated 2 years ago
haonanwang0522 / GTPT
View on GitHub
[ECCV 2024] GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
☆19Oct 5, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hyxcl / nsys_recipes
View on GitHub
these are custom recipes of nvidia nsight system post collection analysis.
☆16Nov 7, 2025Updated 8 months ago
WangWenhao0716 / TIP-I2V
View on GitHub
[ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
☆41Nov 27, 2024Updated last year
bimsarapathiraja / refedit
View on GitHub
[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …
☆20Jun 27, 2025Updated last year
zhaosnw / evo_mem
View on GitHub
☆18Dec 21, 2025Updated 7 months ago
zht8506 / UniHead
View on GitHub
This is the repository for TNNLS paper: "Unihead: unifying multi-perception for detection heads"
☆15Jan 13, 2025Updated last year
L-Zhe / CoRPG
View on GitHub
Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…
☆26Nov 10, 2021Updated 4 years ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year