zchoi/SPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zchoi/SPT)

zchoi / SPT

[TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".

☆10

Alternatives and similar repositories for SPT

Users that are interested in SPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
zchoi / PKOL
View on GitHub
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
☆46Jan 27, 2024Updated 2 years ago
zchoi / S2-Transformer
View on GitHub
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
☆86Aug 14, 2024Updated last year
RainBowLuoCS / MMEvol
View on GitHub
(ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
☆22May 15, 2025Updated last year
kaipengfang / ProS
View on GitHub
☆19Jul 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
marco-garosi / ComCa
View on GitHub
Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"
☆23Dec 23, 2024Updated last year
ExcaliburEX / SMMCC
View on GitHub
🌠用PySimpleGUI实现一个简易的分布式计算系统——简易多机协同计算原型系统(Simply Multi-Machine Collaborative Computing)
☆11May 26, 2020Updated 6 years ago
InstLatx64 / AVX512_PopCnt
View on GitHub
AVX512 population count routines
☆23Aug 2, 2019Updated 6 years ago
yyyanglz / KAN
View on GitHub
Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering
☆10Dec 6, 2019Updated 6 years ago
shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
suny-sht / clip-red-circle
View on GitHub
Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023
☆12Sep 21, 2023Updated 2 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
AmeenAli / VideoMatch
View on GitHub
☆14Jan 5, 2022Updated 4 years ago
aa200647963 / SGG-DHL
View on GitHub
This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.
☆17Aug 6, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
JoannaRay1 / project-xiechengHotel-crawl-analysis
View on GitHub
实现对携程网站的酒店评论爬取，并进行数据预处理和基于情感分类的数据分析，使用了jieba评论分词等处理技术，情感词典，特征值提取，机器学习模型等分析预测技术，词云，热力图等可视化技术
☆13Jul 15, 2022Updated 4 years ago
VL-Group / DPQ
View on GitHub
☆19Dec 16, 2020Updated 5 years ago
NovaMind-Z / PTSN
View on GitHub
Repository for an end-to-end image captioning method PTSN(ACM MM22).
☆60Dec 11, 2022Updated 3 years ago
hpcaitech / GPT-Demo
View on GitHub
GPT Demo with hybrid distributed training
☆10Dec 1, 2022Updated 3 years ago
ZhuGeKongKong / SGG-G2S
View on GitHub
☆21Mar 1, 2022Updated 4 years ago
avcourt / spamfilter-py
View on GitHub
A naïve Bayesian spam filter in Python
☆10Dec 18, 2019Updated 6 years ago
gouzigouzi / attention-residuals-for-chinese-llms
View on GitHub
A Chinese-focused PyTorch framework for exploring Attention Residuals in Qwen3-style causal LMs, with baseline, Block AttnRes, Full AttnR…
☆19May 3, 2026Updated 2 months ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
YangYY-Liu / MatrixChatGPTVoiceBot
View on GitHub
Talk to ChatGPT and Generate image via any Matrix client!
☆16Apr 25, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
jd-opensource / Citrus-V
View on GitHub
Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
☆17Sep 25, 2025Updated 9 months ago
ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
renfei / SpringCloudDemo
View on GitHub
SpringCloud微服务入门教程，包含Eureka注册发现、Config配置中心、BUS消息总线、FeignClient客户端、Zuul网关、Hystrix服务熔断降级、Stream消息队列、Sleuth链路监控、Swagger文档的基本整合演示。
☆11Aug 26, 2024Updated last year
RongKaiWeskerMA / INSTA
View on GitHub
The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
☆13Apr 14, 2024Updated 2 years ago
AISG-Technology-Team / GCSS-Track-1A-Submission-Guide
View on GitHub
Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).
☆16Jul 4, 2024Updated 2 years ago
wqshmzh / CANet-CZSL
View on GitHub
Official pytorch implementation of CVPR2023 paper "Learning Conditional Attributes for Compositional Zero-Shot Learning"
☆18Oct 19, 2025Updated 9 months ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VL-Group / PENET
View on GitHub
[CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"
☆62Jun 8, 2023Updated 3 years ago
youngwanLEE / holisafe
View on GitHub
[CVPR Findings 2026] HoliSafe: Holistic Safety Benchmarking and Modeling for Vision-Language Model
☆17Mar 8, 2026Updated 4 months ago
zchoi / GLSCL
View on GitHub
[TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"
☆16May 12, 2025Updated last year
XinyuLyu / FGPL
View on GitHub
This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".
☆26Jun 7, 2024Updated 2 years ago
xiaosu-zhu / Aurora-Weather
View on GitHub
Aurora Weather
☆24Dec 8, 2016Updated 9 years ago
zhexu1997 / HiSA
View on GitHub
☆10Aug 21, 2022Updated 3 years ago
gordonjun2 / Naturalistic-Adversarial-Patch
View on GitHub
ICCV 2021
☆14Oct 6, 2021Updated 4 years ago