An PyTorch reimplementation of bottom-up-attention models
☆16Jan 5, 2021Updated 5 years ago
Alternatives and similar repositories for bottom_up_features_extract
Users that are interested in bottom_up_features_extract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- Reproduce of 'Weakly Supervised Coupled Networks for Visual Sentiment Analysis'☆13Nov 7, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Nov 13, 2024Updated last year
- python codes for CIDEr - Consensus-based Image Caption Evaluation☆32Jun 25, 2019Updated 6 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Apr 5, 2022Updated 4 years ago
- PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.☆29Jan 26, 2021Updated 5 years ago
- A PyTorch reimplementation of bottom-up-attention models☆301Apr 7, 2022Updated 4 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆168Jan 6, 2019Updated 7 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Tools to estimate the correlation of different text-based evaluation measures for Automatic Image Description☆10Feb 2, 2017Updated 9 years ago
- Evaluating Visual Fidelity of Image Descriptions☆11Aug 15, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- ☆67Nov 11, 2022Updated 3 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- Imply games202 homework in C++ and OpenGL☆13Sep 14, 2022Updated 3 years ago
- ☆17Jun 23, 2022Updated 3 years ago
- official code for paper "MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles"☆13Oct 20, 2020Updated 5 years ago
- Connected Papers knockoff, managing academic papers and citations with graph database.☆12Dec 26, 2023Updated 2 years ago
- Extract features and bounding boxes using the original Bottom-up Attention Faster-RCNN in a few lines of Python code☆11Sep 18, 2022Updated 3 years ago
- 用openCV做的时钟识别,主要用了霍夫变换。☆15Dec 13, 2014Updated 11 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- Office code repository for the paper "Learn to Create Simple LEGO Micro Buildings"☆19Jan 11, 2025Updated last year
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Apr 6, 2026Updated last week
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- [NeurIPS 2024] "Membership Inference on Text-to-image Diffusion Models via Conditional Likelihood Discrepancy"☆12Sep 15, 2025Updated 6 months ago
- Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN☆22Jul 11, 2023Updated 2 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆16Jan 2, 2023Updated 3 years ago
- This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…☆15Dec 1, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Aug 28, 2020Updated 5 years ago
- ☆12Oct 30, 2022Updated 3 years ago
- ☆11Jan 19, 2025Updated last year
- Multimodal deep quality embedding network (MMDQEN) for affective video content analysis. (MM'19, TAFFC'20)☆10Jul 24, 2021Updated 4 years ago
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆50Mar 21, 2026Updated 3 weeks ago
- ☆21Aug 21, 2023Updated 2 years ago
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago