The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
☆21Jul 29, 2024Updated last year
Alternatives and similar repositories for ViVQA
Users that are interested in ViVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 9, 2023Updated 3 years ago
- ☆11Feb 3, 2025Updated last year
- ☆13Dec 6, 2024Updated last year
- On-premises ELT Pipeline☆32Jul 10, 2025Updated 10 months ago
- Improved-YOLOv8☆25Nov 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Oct 29, 2024Updated last year
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆79Feb 13, 2026Updated 3 months ago
- ☆44Aug 7, 2024Updated last year
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆52Jun 7, 2025Updated 11 months ago
- AIO Research Agent - an all-in-one intelligent companion for navigating the academic world.☆39Jun 26, 2024Updated last year
- Building Inspection Toolkit☆32Sep 18, 2023Updated 2 years ago
- ☆31Nov 4, 2024Updated last year
- Materials for the Ultimate Hybrid Search Workshop☆46Dec 13, 2024Updated last year
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆60Apr 8, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the official repository for Retrieval Augmented Visual Question Answering☆250Dec 19, 2024Updated last year
- Lightweight Edge-Real-Time Small Object Detection on Aerial Imagery☆80May 14, 2026Updated 2 weeks ago
- ☆31Mar 10, 2023Updated 3 years ago
- This Repo, Builds an NLP system that analyzes a TV series with NLP and even creates a character chat bot with LLMs☆45Aug 24, 2024Updated last year
- A strong DETR-based detector named Domain Adaptive detection TRansformer (DATR) for unsupervised domain adaptation in object detection.☆51Feb 8, 2025Updated last year
- [ICASSP 2023] Prototype Knowledge Distillation for Medical Segmentation with Missing Modality☆58Feb 21, 2026Updated 3 months ago
- Ultralytics YOLO with Additional Knowledge Distillation Capability☆92Jan 20, 2025Updated last year
- Image Captioning using CNN and Transformer.☆55Nov 9, 2021Updated 4 years ago
- Transformer & CNN Image Captioning model in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆147Aug 22, 2025Updated 9 months ago
- natual language guided image captioning☆88Feb 11, 2024Updated 2 years ago
- A list of awesome remote sensing image captioning resources☆123May 19, 2026Updated last week
- A package to compute medical segmentation metrics.☆186Jul 16, 2024Updated last year
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆168Jan 6, 2019Updated 7 years ago
- Wanna know what your model sees? Here's a package for applying EigenCAM (like GradCAM) and generating heatmap from the new YOLO models☆302Feb 14, 2026Updated 3 months ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆199May 9, 2023Updated 3 years ago
- Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.☆916Jan 6, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collection of Vietnamese Natural Language Processing resources.☆313Oct 28, 2025Updated 7 months ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆459Dec 16, 2020Updated 5 years ago
- Python library for YOLO small object detection and instance segmentation☆549Apr 22, 2026Updated last month
- Transformer-based image captioning extension for pytorch/fairseq☆318Dec 18, 2020Updated 5 years ago
- A PyTorch reimplementation of bottom-up-attention models☆301Apr 7, 2022Updated 4 years ago
- A Vietnamese natural language processing toolkit (NAACL 2018)☆664Feb 12, 2023Updated 3 years ago
- [MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis, one of the first "foundation" models in medical image a…☆781Jun 22, 2025Updated 11 months ago