junyangwang0410/Knight

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/junyangwang0410/Knight)

junyangwang0410 / Knight

SotA text-only image/video method (IJCAI 2023)

☆15

Alternatives and similar repositories for Knight

Users that are interested in Knight are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ml-jku / semantic-image-text-alignment
View on GitHub
☆25Jul 10, 2023Updated 3 years ago
RitaRamo / extra
View on GitHub
Retrieval-augmented Image Captioning
☆13Feb 16, 2023Updated 3 years ago
boreng0817 / IFCap
View on GitHub
[EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
☆15May 13, 2025Updated last year
iOPENCap / awesome-unimodal-training
View on GitHub
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
Lihr747 / CgtGAN
View on GitHub
☆20May 3, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆36Oct 16, 2024Updated last year
DavidHuji / CapDec
View on GitHub
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
☆209Jan 28, 2024Updated 2 years ago
taewhankim / VIPCAP
View on GitHub
☆15Dec 31, 2024Updated last year
amazon-science / camml
View on GitHub
CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024 SAC Award)
☆15May 21, 2025Updated last year
allenai / close
View on GitHub
☆59Aug 30, 2023Updated 2 years ago
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
Sha-Lab / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 7 years ago
Ivy-zoe / script
View on GitHub
same script
☆12Nov 25, 2019Updated 6 years ago
crigaud / publication
View on GitHub
Publication sources, algorithm, code, result, conference poster, scientific paper for ICDAR, CIFED, VISAPP
☆15Jul 8, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Yushi-Hu / PromptCap
View on GitHub
natual language guided image captioning
☆89Feb 11, 2024Updated 2 years ago
ChineseYjh / RAF-DB-baselines
View on GitHub
Implementation of the CVPR'17 paper, Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild
☆13Sep 28, 2022Updated 3 years ago
orensul / analogies_mining
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
ForrestPi / ObjectDetection
View on GitHub
some object detection algo
☆14Jul 25, 2024Updated last year
feizc / DeeCap
View on GitHub
Dynamic Early Exit for Image Captioning
☆17Oct 25, 2022Updated 3 years ago
joeyz0z / MeaCap
View on GitHub
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
☆56Aug 16, 2024Updated last year
SinHanYang / Dual-CAN
View on GitHub
Entity-Aware Dual Co-Attention Network for Fake News Detection, EACL 2023 Findings
☆10Jun 11, 2023Updated 3 years ago
DavidMChan / caption-by-committee
View on GitHub
Using LLMs and pre-trained caption models for super-human performance on image captioning.
☆42Oct 13, 2023Updated 2 years ago
liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MarcusNerva / HMN
View on GitHub
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆50Sep 30, 2022Updated 3 years ago
FuxiaoLiu / Twitter-Video-dataset
View on GitHub
[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms
☆12Sep 26, 2023Updated 2 years ago
YiwuZhong / SGG_from_NLS
View on GitHub
[ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"
☆100Apr 4, 2023Updated 3 years ago
ylingfeng / FGVP
View on GitHub
Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
☆57Feb 1, 2024Updated 2 years ago
Hassi34 / poker-hand-detection
View on GitHub
Poker Hand Detection Using Yolov8
☆15Feb 26, 2023Updated 3 years ago
AHandsomePython / MSMedCap
View on GitHub
Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning
☆16Apr 5, 2024Updated 2 years ago
VirajBagal / MMBERT
View on GitHub
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
☆39Mar 22, 2021Updated 5 years ago
seungjooli / ConditionalGAN
View on GitHub
Convert sketch to face image using Conditional Adversarial Nets (https://phillipi.github.io/pix2pix/)
☆17May 31, 2017Updated 9 years ago
Sixzeroo / HFUTXCNewsNotifications
View on GitHub
监控合肥工业大学宣城校区官网通知变化情况，并发送邮件进行通知
☆12Jun 1, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ahmedssabir / Belief-Revision-Score
View on GitHub
Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022
☆11Apr 13, 2025Updated last year
LuoweiZhou / coco-caption
View on GitHub
kdexd/coco-caption@de6f385
☆26Apr 21, 2020Updated 6 years ago
CUMTGG / CIIC
View on GitHub
☆18Sep 13, 2023Updated 2 years ago
starreeze / efuf
View on GitHub
the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…
☆21Apr 9, 2025Updated last year
YoadTew / zero-shot-image-to-text
View on GitHub
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
☆279Sep 17, 2022Updated 3 years ago
rucinfo-Tiffany / LDA_TopicModeling
View on GitHub
Latent dirichlet allocation using Sklearn
☆18Aug 6, 2018Updated 7 years ago
Lieberk / UNETR
View on GitHub
An implemention of 'UNETR: Transformers for 3D Medical Image Segmentation' based on PaddlePaddle
☆21Jun 7, 2022Updated 4 years ago