A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
☆14Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below
Sorting:
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- ☆23Aug 9, 2021Updated 4 years ago
- Pytorch implementation of Superpoint https://arxiv.org/abs/1712.07629☆10Mar 15, 2021Updated 4 years ago
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 2 years ago
- Matlab code for fast Hausdorff distance for binary images or segmentation maps☆11Mar 10, 2019Updated 6 years ago
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- Machine learning bootcamp https://ztlevi.gitbook.io/ml-101/☆14Jul 22, 2023Updated 2 years ago
- 2020腾讯广告算法大赛方案分享及代码(冠军)☆13May 1, 2023Updated 2 years ago
- Triplet Loss Utility for Pytorch Library.☆13Jul 25, 2024Updated last year
- Multicultural Proverbs and Sayings☆12Jan 11, 2025Updated last year
- Image Segmentation using Fully Convolutional Networks in PyTorch☆11May 16, 2019Updated 6 years ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- The codes for ACM Multimedia 2023 paper 'DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. '☆13Jan 12, 2024Updated 2 years ago
- "Fair Federated AI" Summer School, July 19-21, 2024; 16:00 — 19:30 CET time☆13Aug 19, 2024Updated last year
- 中科大2022春《深度学习导论》课程资源☆10Aug 7, 2022Updated 3 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- Short paper to Medical Imaging with Deep Learning 2023 (#MIDL2023) > https://arxiv.org/abs/2304.03941☆12Jul 17, 2023Updated 2 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- Hyperpatameter Bayesian Optimization for Image Classification in PyTorch☆12Aug 20, 2019Updated 6 years ago
- ☆12Feb 25, 2026Updated last week
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability, IEEE TMM☆15Jun 3, 2025Updated 9 months ago
- Basic implementation of a Siamese network for face similarity using PyTorch☆12Jul 22, 2023Updated 2 years ago
- Reproducible code for Augmentation paper☆17Jan 23, 2019Updated 7 years ago
- 中科大2021秋《机器学习概论》课程资源☆10Jan 31, 2022Updated 4 years ago
- Implementation of Practical Facial Landmark Detector (PFLD) on Pytorch☆14Jul 23, 2023Updated 2 years ago
- ☆15Jan 28, 2021Updated 5 years ago
- PyTorch implementation of Original UNet Paper☆13Dec 13, 2020Updated 5 years ago
- ☆11Nov 26, 2025Updated 3 months ago
- 基于多模态检索的互联网图文匹配☆15Mar 17, 2024Updated last year
- Alpha version of our data-centric visual benchmark for training data selection☆16Aug 28, 2023Updated 2 years ago
- yolov8 tensorRT 的 C++部署。☆16Oct 29, 2024Updated last year
- ☆17Nov 1, 2023Updated 2 years ago
- This repository contains all code to support the paper: "Deep Learning for Detection and Localization of B-Lines in Lung Ultrasound".☆18Jul 25, 2024Updated last year
- 电商广告推荐系统☆14Jun 3, 2022Updated 3 years ago
- C++11多线程入门☆12Apr 24, 2019Updated 6 years ago
- ☆17Mar 31, 2020Updated 5 years ago
- 淘宝用户行为数据案例分析☆17May 18, 2020Updated 5 years ago