The project is an Android application aimed to help the visually impaired by giving them the ability to take a picture, ask questions about it and the application will provide them with the answers using machine learning techniques and tools.
☆8May 28, 2022Updated 3 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An MCP server for Google Scholar written in TypeScript with Streamable HTTP☆21Aug 18, 2025Updated 8 months ago
- ☆12Nov 3, 2024Updated last year
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Craft and run Agents right from your phone☆31Oct 14, 2025Updated 6 months ago
- Extends Spotlight by running JavaScript.☆48Dec 23, 2025Updated 4 months ago
- A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the Viz…☆14Jun 27, 2023Updated 2 years ago
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆26Jan 17, 2026Updated 3 months ago
- ☆13Sep 8, 2024Updated last year
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆19May 27, 2025Updated 11 months ago
- ☆31May 1, 2024Updated 2 years ago
- ☆13Apr 10, 2025Updated last year
- ☆13Jul 1, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 10 months ago
- ☆21Jun 4, 2025Updated 11 months ago
- ☆19Dec 13, 2023Updated 2 years ago
- An ultra-minimalistic, lightweight, native macOS PDF viewer☆49Apr 24, 2026Updated last week
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆26Nov 17, 2025Updated 5 months ago
- G^3: Geolocation via Guidebook Grounding, Findings of EMNLP 2022☆17Sep 10, 2024Updated last year
- Dataset for people walk on the roads☆15Mar 2, 2024Updated 2 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated last year
- under review☆14Mar 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks☆14May 2, 2025Updated last year
- [CVPR 2022] HINT: Hierarchical Neuron Concept Explainer☆20Apr 19, 2023Updated 3 years ago
- A Chrome Extension that allows developers to inspect, monitor, and execute tools exposed via the experimental `navigator.modelContextTest…☆73Apr 1, 2026Updated last month
- theme_title_use_abbreviated_path equivalent for zsh on osx☆12Feb 28, 2021Updated 5 years ago
- This is the official source code for CVPR 2024 paper [WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by I…☆16Mar 26, 2024Updated 2 years ago
- Standard button types for SwiftUI.☆37Jan 18, 2026Updated 3 months ago
- Up-to-date Vision Language Models collection. Mainly focus on computer vision☆19Feb 9, 2023Updated 3 years ago
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆65Jul 24, 2025Updated 9 months ago
- A native iOS double slider for Titanium Mobile.☆26Feb 22, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 3 years ago
- ☆34Feb 27, 2025Updated last year
- ☆13May 3, 2023Updated 3 years ago
- MCP server for OpenAlex☆41Aug 11, 2025Updated 8 months ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆29Jan 26, 2025Updated last year
- ☆31Dec 29, 2025Updated 4 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 2 months ago