A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
☆14Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- ☆23Aug 9, 2021Updated 4 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- ☆30Mar 24, 2018Updated 8 years ago
- odgt CrowdHuman dataset annotation to YOLO txt and Pascal VOC xml☆10Dec 1, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- My notes for machine learning☆10Apr 27, 2022Updated 3 years ago
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- ☆12Feb 25, 2026Updated last month
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 2 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- 科大讯飞线下销量挑战赛top7方案☆12Aug 21, 2021Updated 4 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.☆20Feb 27, 2022Updated 4 years ago
- USTC网络安全实验室网站源码☆11Sep 1, 2022Updated 3 years ago
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆18Jan 19, 2024Updated 2 years ago
- 中科大2022春《深度学习导论》课程资源☆10Aug 7, 2022Updated 3 years ago
- The codes for ACM Multimedia 2023 paper 'DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. '☆13Jan 12, 2024Updated 2 years ago
- Pytorch implementation of Superpoint https://arxiv.org/abs/1712.07629☆10Mar 15, 2021Updated 5 years ago
- Machine learning bootcamp https://ztlevi.gitbook.io/ml-101/☆14Jul 22, 2023Updated 2 years ago
- yolov8 tensorRT 的 C++部署。☆16Oct 29, 2024Updated last year
- Short paper to Medical Imaging with Deep Learning 2023 (#MIDL2023) > https://arxiv.org/abs/2304.03941☆12Jul 17, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- AMS实时推荐系统☆17Nov 4, 2022Updated 3 years ago
- Triplet Loss Utility for Pytorch Library.☆13Jul 25, 2024Updated last year
- ☆17Nov 1, 2023Updated 2 years ago
- 推荐系统☆13Oct 4, 2022Updated 3 years ago
- TIP: Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification☆22Mar 29, 2021Updated 5 years ago
- Code repo of solution of 11th place in Recsys Challenge 2022☆12Jul 13, 2022Updated 3 years ago
- 中科大2021秋《机器学习概论》课程资源☆10Jan 31, 2022Updated 4 years ago
- C++11多线程入门☆12Apr 24, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- 基于多模态检索的互联网图文匹配☆15Mar 17, 2024Updated 2 years ago
- 微信大数据挑战赛2021☆18Sep 6, 2021Updated 4 years ago
- ☆13Dec 16, 2022Updated 3 years ago
- "Fair Federated AI" Summer School, July 19-21, 2024; 16:00 — 19:30 CET time☆13Aug 19, 2024Updated last year
- 电商广告推荐系统☆14Jun 3, 2022Updated 3 years ago