weimingboya/DFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weimingboya/DFT)

weimingboya / DFT

☆13

Alternatives and similar repositories for DFT

Users that are interested in DFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
xmu-xiaoma666 / LSTNet
View on GitHub
Towards Local Visual Modeling for Image Captioning
☆30Mar 31, 2023Updated 3 years ago
mrwu-mac / DIFNet
View on GitHub
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆21Nov 28, 2022Updated 3 years ago
232525 / PureT
View on GitHub
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
☆70Jun 1, 2024Updated 2 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zchoi / S2-Transformer
View on GitHub
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
☆86Aug 14, 2024Updated last year
SjokerLily / awesome-image-captioning
View on GitHub
A paper list of image captioning.
☆21Apr 23, 2022Updated 4 years ago
GT-RIPL / Xmodal-Ctx
View on GitHub
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆61Oct 21, 2022Updated 3 years ago
hfutmars / MGCL
View on GitHub
The complete codes of the paper "Multimodal Graph Contrastive Learning for Recommendation"
☆10Mar 20, 2023Updated 3 years ago
FeiElysia / ViECap
View on GitHub
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
☆167Sep 9, 2024Updated last year
zhyfzy / SRCNN-Android
View on GitHub
A Implementation of Image Super-Resolution using SRCNN model on Android Platform
☆15Oct 14, 2017Updated 8 years ago
smileslabsh / Generative-Label-Fused-Network
View on GitHub
Generative label fused network for image–text matching
☆10Jan 13, 2023Updated 3 years ago
Jiaxuan-Li / EVCap
View on GitHub
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
☆64Apr 8, 2024Updated 2 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year
Qrange-group / Mirror-Gradient
View on GitHub
WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…
☆17Nov 1, 2024Updated last year
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
Arthurizijar / machine-translation
View on GitHub
本项目为北航自然语言处理课程2020秋季学年大作业
☆10Dec 28, 2020Updated 5 years ago
fqldom / BeFA
View on GitHub
BeFA: A General Behavior-driven Feature Adapter for Multimedia Recommendation
☆13Feb 21, 2025Updated last year
80chen86 / IPDN
View on GitHub
☆17Dec 25, 2025Updated 6 months ago
guojin-yan / PaddleInferenceSharp
View on GitHub
☆20Nov 4, 2022Updated 3 years ago
Neon-Jing / Guider
View on GitHub
[WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…
☆14Oct 14, 2025Updated 9 months ago
NanGongNingYi / GUME
View on GitHub
☆16Jul 19, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jianjieluo / SCD-Net
View on GitHub
[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…
☆68Jun 11, 2024Updated 2 years ago
voidism / ML2018SPRING
View on GitHub
NTUEE 2018 spring course - Machine Learning (Pei-Yuan Wu, Hung-Yi Lee, Tsungnan Lin)
☆12Jul 22, 2018Updated 8 years ago
andyweizhao / Multitask_Image_Captioning
View on GitHub
☆23Aug 18, 2018Updated 7 years ago
jongwook / crepe
View on GitHub
☆12Jun 5, 2018Updated 8 years ago
tany0699 / FMViT
View on GitHub
☆31Aug 3, 2023Updated 2 years ago
abdfahim / audioprocessing
View on GitHub
Standard libraries for audio processing, especially STFT and Spherical Harmonics decomposition of a soundfield.
☆10Nov 29, 2021Updated 4 years ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year
YuanEZhou / satic
View on GitHub
☆26Jun 25, 2021Updated 5 years ago
liranhao / recognition_qrcode
View on GitHub
The Flutter recognition the qrcode in the image
☆18Aug 22, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CRIPAC-DIG / Balanced-Multimodal-Rec
View on GitHub
☆15Mar 11, 2025Updated last year
davidsvy / hard-negative-mixing
View on GitHub
An unofficial PyTorch implementation of the NeurIPS 2020 paper Hard Negative Mixing for Contrastive Learning.
☆20Oct 17, 2022Updated 3 years ago
cswhjiang / Recurrent_Fusion_Network
View on GitHub
Source code for "Recurrent Fusion Network for Image Captioning".
☆23Nov 24, 2018Updated 7 years ago
lbtrace / Waifu2xAndroid
View on GitHub
waifu2x implementation based on Tensorflow for mobile in Android
☆25Apr 30, 2018Updated 8 years ago
ZhenhHuang / MSG
View on GitHub
[NeurIPS 2024] Spiking Graph Neural Networks on Riemannian Manifolds
☆23Oct 22, 2024Updated last year
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
FeiElysia / awesome-zero-shot-captioning
View on GitHub
A curated list of zero-shot captioning papers
☆24Aug 26, 2023Updated 2 years ago