nngocson2002 / ViVQALinks
The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
☆21Updated last year
Alternatives and similar repositories for ViVQA
Users that are interested in ViVQA are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- ☆42Updated last year
- AIVIETNAM-Hub / Hybrid-Unified-and-Iterative-A-Novel-Framework-for-Text-based-Person-Anomaly-Retrieval☆13Updated 6 months ago
- ☆32Updated 2 years ago
- BED-AIO team code for AIChallenge2023☆43Updated last year
- ☆12Updated 2 years ago
- ☆27Updated last year
- This project serves as a hands-on introduction to essential MLOps tools☆54Updated 10 months ago
- ☆33Updated 6 months ago
- ☆30Updated last year
- This repository contains the official source code for SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank T…☆28Updated 2 months ago
- [NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment☆40Updated last month
- ☆25Updated last year
- Pytorch implementation of image captioning using transformer-based model.☆68Updated 2 years ago
- [LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Pr…☆373Updated last month
- ☆13Updated last year
- AICITY2024 Track 2 - Code from AIO_ISC Team☆37Updated last year
- A web app for both Text-based and Visual Question Answering.☆13Updated 2 years ago
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆38Updated 8 months ago
- MLOps platform powered by Kubeflow☆26Updated 4 months ago
- A simple YOLOv10 streamlit web demo☆24Updated last year
- Implement image classification in pytorch☆40Updated 2 years ago
- Phát triển ứng dụng web☆11Updated 4 years ago
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆71Updated 6 months ago
- [NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.☆36Updated 9 months ago
- ☆69Updated last year
- DSLab Training☆24Updated last year
- Large Language Models (LLMs) Learning Resources☆19Updated last year
- ☆16Updated 3 years ago
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆51Updated 11 months ago