hassanhub/R3Transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hassanhub/R3Transformer)

hassanhub / R3Transformer

Official python implementation of R3-Transformer

☆15

Alternatives and similar repositories for R3Transformer

Users that are interested in R3Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feichtenhofer / temporal-resnet
View on GitHub
☆11Sep 15, 2017Updated 8 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
mbforbes / physical-commonsense
View on GitHub
Do Neural Language Representations Learn Physical Commonsense?
☆22Dec 28, 2021Updated 4 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
MarcBS / TMA
View on GitHub
Egocentric Video Description based on Temporally-Linked Sequences
☆11Jul 17, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Sha-Lab / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 7 years ago
frankxu2004 / cooking-procedural-extraction
View on GitHub
☆19May 2, 2020Updated 6 years ago
dpfried / action-segmentation
View on GitHub
Weakly-supervised action segmentation in video
☆16Feb 13, 2022Updated 4 years ago
microsoft / ExperimentTools
View on GitHub
XTlib is an API and command line tool for scaling and managing ML experiments. The goal of XTLib is to enable you to effortlessly organi…
☆15Jul 5, 2023Updated 3 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
GingL / ARN
View on GitHub
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
☆32Aug 29, 2019Updated 6 years ago
delchiaro / RATT
View on GitHub
☆18Oct 3, 2023Updated 2 years ago
jamespark3922 / adv-inf
View on GitHub
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Jul 17, 2019Updated 7 years ago
ronghanghu / moco_v3_tpu
View on GitHub
☆16Apr 10, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
interactive-cookbook / ara
View on GitHub
Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021
☆10May 22, 2024Updated 2 years ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
siliu-group / pic-challenge
View on GitHub
PIC API
☆25Sep 18, 2019Updated 6 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
gsig / visual-grounding
View on GitHub
Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020
☆43Apr 26, 2020Updated 6 years ago
ck0123 / improved-bertscore-for-image-captioning-evaluation
View on GitHub
☆21Jul 25, 2024Updated 2 years ago
facebookresearch / binary-image-selection
View on GitHub
BISON: Binary Image SelectiON
☆50Sep 15, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LuoweiZhou / anet2016-cuhk-feature
View on GitHub
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Mar 31, 2019Updated 7 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
KaihuaTang / VCTree-Scene-Graph-Generation
View on GitHub
Code for the Scene Graph Generation part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"
☆124Jan 6, 2026Updated 6 months ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
doubledaibo / 2dcaption_eccv2018
View on GitHub
Rethinking the Form of Latent States in Image Captioning
☆20Aug 31, 2018Updated 7 years ago
Pratik08 / Vis-DSS
View on GitHub
☆12Dec 9, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
willieneis / ProBO
View on GitHub
ProBO: Versatile Bayesian Optimization Using Any Probabilistic Programming Language
☆16Jul 4, 2019Updated 7 years ago
groverjeenu / Bilingual-Word-Embeddings-with-Bucketed-CNN-for-Parallel-Sentence-Extraction
View on GitHub
Code for our paper in ACL 2017
☆13Dec 14, 2017Updated 8 years ago
tomekkorbak / treehopper
View on GitHub
A Tree-LSTM-based dependency tree sentiment labeler
☆15May 9, 2019Updated 7 years ago
soCzech / ChangeIt
View on GitHub
ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022
☆11Mar 23, 2022Updated 4 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago