An PyTorch reimplementation of bottom-up-attention models
☆16Jan 5, 2021Updated 5 years ago
Alternatives and similar repositories for bottom_up_features_extract
Users that are interested in bottom_up_features_extract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- Reproduce of 'Weakly Supervised Coupled Networks for Visual Sentiment Analysis'☆13Nov 7, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Nov 13, 2024Updated last year
- ☆16Jul 10, 2024Updated last year
- A PyTorch implementation of paper "Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering"☆11Aug 25, 2020Updated 5 years ago
- PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.☆29Jan 26, 2021Updated 5 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- A PyTorch reimplementation of bottom-up-attention models☆301Apr 7, 2022Updated 4 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆168Jan 6, 2019Updated 7 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Evaluating Visual Fidelity of Image Descriptions☆11Aug 15, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- a codebase for multi label classification with PyTorch.☆15Nov 23, 2022Updated 3 years ago
- ☆67Nov 11, 2022Updated 3 years ago
- An official codebase of Two-Stream Transformer for Multi-Label Image Classification, ACMMM 2022.☆17May 6, 2024Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- official code for paper "MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles"☆13Oct 20, 2020Updated 5 years ago
- Extract features and bounding boxes using the original Bottom-up Attention Faster-RCNN in a few lines of Python code☆11Sep 18, 2022Updated 3 years ago
- 用openCV做的时钟识别,主要用了霍夫变换。☆15Dec 13, 2014Updated 11 years ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Apr 27, 2026Updated last week
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆47Mar 21, 2024Updated 2 years ago
- ☆24Dec 22, 2016Updated 9 years ago
- [NeurIPS 2024] "Membership Inference on Text-to-image Diffusion Models via Conditional Likelihood Discrepancy"☆12Sep 15, 2025Updated 7 months ago
- Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN☆22Jul 11, 2023Updated 2 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- Multimodal deep quality embedding network (MMDQEN) for affective video content analysis. (MM'19, TAFFC'20)☆10Jul 24, 2021Updated 4 years ago
- Resnet-34 with light-weighted decoder for pose estimation. (Pytorch)☆21Sep 21, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Image captioning with Transformer☆14Oct 11, 2021Updated 4 years ago
- ☆64Jan 5, 2022Updated 4 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 3 years ago
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆67Jun 11, 2024Updated last year
- ☆11Mar 19, 2023Updated 3 years ago
- A simple wrapper for lmdb. Support dict-like operations.☆23Apr 20, 2023Updated 3 years ago
- Implemention of CapsNet from the paper Dynamic Routing Between Capsules☆10Nov 7, 2017Updated 8 years ago