A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
☆14Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- SpringCloud微服务入门教程,包含Eureka注册发现、Config配置中心、BUS消息总线、FeignClient客户端 、Zuul网关、Hystrix服务熔断降级、Stream消息队列、Sleuth链路监控、Swagger文档的基本整合演示。☆11Aug 26, 2024Updated last year
- ☆13Nov 15, 2022Updated 3 years ago
- Calculates the output shape of Pytorch operations☆15May 30, 2023Updated 2 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 3 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Oct 30, 2021Updated 4 years ago
- 科大讯飞线下销量挑战赛top7方案☆13Aug 21, 2021Updated 4 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- 2020腾讯广告算法大赛方案分享及代码(冠军)☆14May 1, 2023Updated 3 years ago
- A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.☆20Feb 27, 2022Updated 4 years ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆11May 10, 2024Updated 2 years ago
- ☆12Jan 4, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- USTC网络安全实验室网站源码☆11Sep 1, 2022Updated 3 years ago
- Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability, IEEE TMM☆15Jun 3, 2025Updated 11 months ago
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆18Jan 19, 2024Updated 2 years ago
- The codes for ACM Multimedia 2023 paper 'DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. '☆13Jan 12, 2024Updated 2 years ago
- Pytorch implementation of Superpoint https://arxiv.org/abs/1712.07629☆10Mar 15, 2021Updated 5 years ago
- A PyTorch implementation of transformer for text generation.☆16Mar 17, 2019Updated 7 years ago
- Matlab code for fast Hausdorff distance for binary images or segmentation maps☆11Mar 10, 2019Updated 7 years ago
- Machine learning bootcamp https://ztlevi.gitbook.io/ml-101/☆14Jul 22, 2023Updated 2 years ago
- Triplet neural network for joint representation learning for text and images☆10Mar 17, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆22Oct 3, 2023Updated 2 years ago
- Short paper to Medical Imaging with Deep Learning 2023 (#MIDL2023) > https://arxiv.org/abs/2304.03941☆12Jul 17, 2023Updated 2 years ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- AMS实时推荐系统☆17Nov 4, 2022Updated 3 years ago
- ☆17Nov 1, 2023Updated 2 years ago
- Hyperpatameter Bayesian Optimization for Image Classification in PyTorch☆11Aug 20, 2019Updated 6 years ago
- Code repo of solution of 11th place in Recsys Challenge 2022☆12Jul 13, 2022Updated 3 years ago
- TIP: Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification☆21Mar 29, 2021Updated 5 years ago
- Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…☆23Jul 30, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 中科大2021秋《机器学习概论》课程资源☆10Jan 31, 2022Updated 4 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- 基于多模态检索的互联网图文匹配☆15Mar 17, 2024Updated 2 years ago
- Code for the 15th place submission at Trading at the Close competition☆18Jun 22, 2024Updated last year
- VQA-Med 2021☆22May 13, 2026Updated last week
- 1D-CNN models for NAFLD diagnosis and liver fat fraction quantification using radiofrequency ultrasound signals☆12Jun 10, 2020Updated 5 years ago
- Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting https://arxiv.org/abs/2108.08023…☆22Sep 9, 2021Updated 4 years ago