calisolo/Levels_image_captioning_NICE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/calisolo/Levels_image_captioning_NICE)

calisolo / Levels_image_captioning_NICE

NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU

☆11

Alternatives and similar repositories for Levels_image_captioning_NICE

Users that are interested in Levels_image_captioning_NICE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

USTC-IMCC / PaperReading
View on GitHub
Paper Reading of IMCC groups.
☆18Oct 22, 2025Updated 9 months ago
leekchan / phpy
View on GitHub
phPy is a simple way to call legacy PHP functions from Python.
☆28Aug 27, 2015Updated 10 years ago
RAIVNLab / CREPE
View on GitHub
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Apr 27, 2023Updated 3 years ago
franciszzj / OpenPSG
View on GitHub
[ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
☆51Jan 8, 2025Updated last year
3DHCG / JDHR
View on GitHub
☆15Nov 11, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
icepeng / sts-logs
View on GitHub
Slay the Spire stat analyzer
☆10Mar 17, 2018Updated 8 years ago
canyonfrs / kingmojang
View on GitHub
킹모짱 모노레포
☆10Oct 24, 2023Updated 2 years ago
lezhang7 / Enhance-FineGrained
View on GitHub
[CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
☆56Apr 7, 2025Updated last year
SivanDoveh / TSVLC
View on GitHub
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Sep 25, 2023Updated 2 years ago
assembly-101 / assembly101-mistake-detection
View on GitHub
Annotations for the Mistake Detection benchmark of Assembly101
☆12Aug 3, 2023Updated 2 years ago
franciszchen / SCA-Net
View on GitHub
☆10Oct 7, 2023Updated 2 years ago
kpug / fpis
View on GitHub
☆12May 31, 2017Updated 9 years ago
Meteor-han / ReLMole
View on GitHub
☆14Jun 25, 2022Updated 4 years ago
ytaek-oh / retriever
View on GitHub
☆11Sep 15, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
saferlhf-v / saferlhf-v
View on GitHub
☆23Jun 16, 2025Updated last year
lscpku / VITATECS
View on GitHub
☆18Jul 10, 2024Updated 2 years ago
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
ykteh93 / Deep_Reinforcement_Learning-Atari
View on GitHub
Deep Q-Network (DQN) to play classic Atari Games
☆11Sep 18, 2017Updated 8 years ago
GauravGajbhiye / SCAMET_RSIC
View on GitHub
This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.
☆13Aug 10, 2023Updated 2 years ago
cleary-lab / CISI
View on GitHub
code for composite in situ imaging (cisi) analysis
☆12Oct 26, 2020Updated 5 years ago
TrilonIO / angular-universal-v9
View on GitHub
What's new with Angular Universal v9
☆18Feb 21, 2020Updated 6 years ago
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆16Jul 15, 2025Updated last year
thuhcsi / AdaMesh
View on GitHub
☆18Jun 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zjr2000 / Untrimmed-Video-Feature-Extractor
View on GitHub
A simple and effective feature extractor for untrimmed videos
☆13Sep 1, 2022Updated 3 years ago
GX77 / TextKG
View on GitHub
☆11Jun 27, 2023Updated 3 years ago
KoDohwan / VT-TWINS
View on GitHub
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)
☆11Oct 12, 2022Updated 3 years ago
QiQAng / UEDVC
View on GitHub
☆12May 26, 2023Updated 3 years ago
3DHCG / Jittor_DiffPoseTalk
View on GitHub
Jittor implementation of DiffPoseTalk(SIGGRAPH 2024)
☆25Nov 11, 2024Updated last year
CAMMA-public / rendezvous-in-time
View on GitHub
rendezvous-in-time
☆14Sep 17, 2025Updated 10 months ago
BIGKnight / Understanding-Training-free-Diffusion-Guidance
View on GitHub
☆19Mar 18, 2024Updated 2 years ago
xmed-lab / DistillingSelf
View on GitHub
MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions
☆13Sep 17, 2022Updated 3 years ago
TeleeMa / SADE
View on GitHub
An Examination of the Compositionality of Large Generative Vision-Language Models
☆19Apr 9, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XuMengyaAmy / CIDACaptioning
View on GitHub
☆17Jul 5, 2021Updated 5 years ago
loro-dev / loro-mirror
View on GitHub
☆25Jun 22, 2026Updated last month
ruotianluo / coco-caption
View on GitHub
☆67Nov 11, 2022Updated 3 years ago
appletea233 / Temporal-R1
View on GitHub
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
☆62Jun 6, 2025Updated last year
YaoXie2 / Resnet50InCub200
View on GitHub
resnet50code
☆18Dec 29, 2024Updated last year
longbai1006 / CAT-ViL
View on GitHub
Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…
☆18Jul 7, 2024Updated 2 years ago
huabao97 / barcode-recognition
View on GitHub
一维条形码识别
☆15Oct 27, 2021Updated 4 years ago