shengyuzhang/Poet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shengyuzhang/Poet)

shengyuzhang / Poet

Poet: Product-oriented Video Captioner for E-commerce

☆12

Alternatives and similar repositories for Poet

Users that are interested in Poet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
visinf / cos-cvae
View on GitHub
Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)
☆37May 16, 2022Updated 4 years ago
xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
hassanhub / R3Transformer
View on GitHub
Official python implementation of R3-Transformer
☆15Nov 30, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
chenchy / D3Net
View on GitHub
A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
aimagelab / mvad-names-dataset
View on GitHub
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆24Jul 9, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vBaiCai / vc_tacotron
View on GitHub
Voice Conversion using Tacotron.
☆11Dec 29, 2022Updated 3 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
yikang-li / vg_cleansing
View on GitHub
dataset cleansing for Visual Genome
☆30Apr 26, 2017Updated 9 years ago
syuqings / video-paragraph
View on GitHub
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Oct 21, 2021Updated 4 years ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
chitwansaharia / HACAModel
View on GitHub
Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…
☆26Nov 3, 2018Updated 7 years ago
Kizuna-AII / Realm-Before-the-Omniscience
View on GitHub
Project for ZJU-Game-2021
☆10Sep 20, 2021Updated 4 years ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
View on GitHub
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Jul 31, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mynlp / cst_captioning
View on GitHub
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
☆60May 15, 2018Updated 8 years ago
DeerSheep0314 / Re4-Learning-to-Re-contrast-Re-attend-Re-construct-for-Multi-interest-Recommendation
View on GitHub
☆23Aug 4, 2022Updated 3 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
jssprz / visual_syntactic_embedding_video_captioning
View on GitHub
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
☆30Apr 16, 2021Updated 5 years ago
NYElegance / SimulLR
View on GitHub
PyTorch Implementation of SimulLR
☆11Dec 30, 2021Updated 4 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
kreimanlab / DeepLearning-vs-HighLevelVision
View on GitHub
Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?
☆12Dec 22, 2020Updated 5 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
hobincar / pytorch-video-feature-extractor
View on GitHub
A repository for extract CNN features from videos using pytorch
☆70Nov 22, 2022Updated 3 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
hhuang-code / VideoSM
View on GitHub
Video Summarization (Attention Mechanism and Hierarchical LSTM)
☆31Feb 14, 2018Updated 8 years ago
Pratik08 / Vis-DSS
View on GitHub
☆12Dec 9, 2018Updated 7 years ago
morikatron / yakinori
View on GitHub
Japanese Converter Kanji to Hiragana, Katakana, Roma-ji
☆13Jul 19, 2023Updated 3 years ago