htqin / GoogleBard-VisUnderstandView external linksLinks
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges
☆30Sep 24, 2023Updated 2 years ago
Alternatives and similar repositories for GoogleBard-VisUnderstand
Users that are interested in GoogleBard-VisUnderstand are comparing it to the libraries listed below
Sorting:
- Adaptive Inter-Class Similarity Distillation for Semantic Segmentation (MTAP 2025)☆29Nov 14, 2025Updated 3 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- ☆13May 12, 2025Updated 9 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆31Dec 8, 2023Updated 2 years ago
- ☆13Aug 25, 2023Updated 2 years ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark☆17May 25, 2025Updated 8 months ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- The official repo for "GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction"☆29Mar 29, 2024Updated last year
- Official repo for “Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy”☆14Nov 26, 2024Updated last year
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34May 8, 2024Updated last year
- Visual Representation Learning Benchmark for Self-Supervised Models☆35Apr 18, 2024Updated last year
- 2D road segmentation using lidar data during training☆43Dec 21, 2023Updated 2 years ago
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Jul 29, 2024Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Jun 14, 2024Updated last year
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos☆22Jan 26, 2026Updated 3 weeks ago
- This repo contains the eval code for Hybrid-E-loss, which is written by PyTorch code.☆18Dec 29, 2022Updated 3 years ago
- ☆20Apr 23, 2024Updated last year
- Count Tokens of Code (forked from gocloc)☆44Aug 19, 2024Updated last year
- [MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descripti…☆27Aug 5, 2024Updated last year
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- ☆47Jan 18, 2024Updated 2 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Official implementation of Inconsistency Masks. A robust semi-supervised segmentation framework that reframes model disagreement as a…☆19Jan 23, 2026Updated 3 weeks ago
- PyTorch implementation of GAFlow: Incorporating Gaussian Attention into Optical Flow (ICCV-2023)☆42Oct 11, 2023Updated 2 years ago
- ☆21Nov 9, 2025Updated 3 months ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25May 16, 2024Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- MERA tensor network for tiny object image classification☆16Mar 31, 2022Updated 3 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Oct 20, 2023Updated 2 years ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 5 months ago
- This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)☆92Mar 19, 2024Updated last year
- ☆88Jan 10, 2024Updated 2 years ago