How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks, ICLR 2026
☆72Mar 6, 2026Updated 3 months ago
Alternatives and similar repositories for fm-vision-evals
Users that are interested in fm-vision-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- ☆47Jun 24, 2025Updated 11 months ago
- ☆15Oct 12, 2024Updated last year
- Training recipe for SpatialReasoner [NeurIPS 2025]☆45Apr 5, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 7 months ago
- ICLR 2026: Agent-X Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆42Apr 28, 2026Updated last month
- PointNu-Net Project☆19Dec 28, 2023Updated 2 years ago
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆64Mar 20, 2026Updated 2 months ago
- Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)☆33Nov 30, 2025Updated 6 months ago
- ☆17Aug 5, 2025Updated 10 months ago
- ☆22Sep 16, 2025Updated 9 months ago
- ☆25Jul 10, 2023Updated 2 years ago
- Code accompanying PDiscoNet: Semantically consistent part discovery for fine-grained recognition☆15Dec 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- documentation used in my projects