A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
☆14Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Aug 9, 2021Updated 4 years ago
- SpringCloud微服务入门教程,包含Eureka注册发现、Config配置中心、BUS消息总线、FeignClient客户端 、Zuul网关、Hystrix服务熔断降级、Stream消息队列、Sleuth链路监控、Swagger文档的基本整合演示。☆11Aug 26, 2024Updated last year
- ☆12Nov 15, 2022Updated 3 years ago
- Acceleration-friendly component architecture framework☆19Feb 20, 2026Updated last month
- Port of YOLOv4 to C# + TensorFlow☆12Dec 29, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Oct 30, 2021Updated 4 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- A library for automatically extracting color palettes from images☆10Mar 20, 2016Updated 10 years ago
- 2020腾讯广告算法大赛方案分享及代码(冠军)☆14May 1, 2023Updated 2 years ago
- A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.☆20Feb 27, 2022Updated 4 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- ☆12Jan 4, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability, IEEE TMM☆15Jun 3, 2025Updated 9 months ago
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆18Jan 19, 2024Updated 2 years ago
- The codes for ACM Multimedia 2023 paper 'DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. '☆13Jan 12, 2024Updated 2 years ago
- Pytorch implementation of Superpoint https://arxiv.org/abs/1712.07629☆10Mar 15, 2021Updated 5 years ago
- A PyTorch implementation of transformer for text generation.☆16Mar 17, 2019Updated 7 years ago
- Matlab code for fast Hausdorff distance for binary images or segmentation maps☆11Mar 10, 2019Updated 7 years ago
- ☆11Nov 26, 2025Updated 4 months ago
- Triplet neural network for joint representation learning for text and images☆10Mar 17, 2019Updated 7 years ago
- ☆21Oct 3, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- yolov8 tensorRT 的 C++部署。☆16Oct 29, 2024Updated last year
- ☆16Mar 22, 2024Updated 2 years ago
- 人人都能看懂的轻量级解决方案☆16Jul 10, 2020Updated 5 years ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- Short paper to Medical Imaging with Deep Learning 2023 (#MIDL2023) > https://arxiv.org/abs/2304.03941☆12Jul 17, 2023Updated 2 years ago
- Triplet Loss Utility for Pytorch Library.☆13Jul 25, 2024Updated last year
- Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"☆25Oct 1, 2024Updated last year
- ☆17Nov 1, 2023Updated 2 years ago
- Hyperpatameter Bayesian Optimization for Image Classification in PyTorch☆12Aug 20, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 推荐系统☆13Oct 4, 2022Updated 3 years ago
- TIP: Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification☆22Mar 29, 2021Updated 4 years ago
- C++11多线程入门☆12Apr 24, 2019Updated 6 years ago
- 清华大学软件学院-数据结构C++/qt大作业☆12Sep 2, 2020Updated 5 years ago
- CMU 15-441 项目一 Liso Web服务器☆17Nov 4, 2018Updated 7 years ago
- 微信大数据挑战赛2021☆18Sep 6, 2021Updated 4 years ago
- ☆13Dec 16, 2022Updated 3 years ago