How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges
☆30Sep 24, 2023Updated 2 years ago
Alternatives and similar repositories for GoogleBard-VisUnderstand
Users that are interested in GoogleBard-VisUnderstand are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos☆23Jan 26, 2026Updated 2 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- [MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descripti…☆27Aug 5, 2024Updated last year
- ☆20Dec 29, 2020Updated 5 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- The official repo for "GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction"☆29Mar 29, 2024Updated 2 years ago
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Mar 17, 2026Updated last week
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Jul 29, 2024Updated last year
- ☆14Jun 25, 2022Updated 3 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- MIMIC: Masked Image Modeling with Image Correspondences☆17Jun 14, 2024Updated last year
- Panoramic audiovisual salient object segmentation☆30Jul 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- ☆20Apr 23, 2024Updated last year
- ☆14Mar 20, 2026Updated last week
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Nov 11, 2023Updated 2 years ago
- Count Tokens of Code (forked from gocloc)☆45Aug 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for "On Improving Adversarial Transferability of Vision Transformers" (ICLR 2022--Spotlight)☆72Nov 19, 2022Updated 3 years ago
- ☆40Jul 18, 2022Updated 3 years ago
- ☆31Dec 18, 2025Updated 3 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Oct 20, 2023Updated 2 years ago
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…☆52Nov 14, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- Local LLM-based social network filter☆71Jan 31, 2024Updated 2 years ago
- ☆88Jan 10, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- Public repository for the ECCV 2024 paper "Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation".☆26Aug 5, 2025Updated 7 months ago
- ☆11Oct 29, 2024Updated last year
- ☆53May 7, 2024Updated last year
- Two-Step Quantization on AlexNet☆13Jun 29, 2018Updated 7 years ago