Vision-oriented multimodal AI
☆51Jun 15, 2024Updated last year
Alternatives and similar repositories for SA-Segment-Anything
Users that are interested in SA-Segment-Anything are comparing it to the libraries listed below
Sorting:
- Segment Anything in Defect Detection☆22Jun 2, 2024Updated last year
- ☆15Sep 23, 2024Updated last year
- ☆11Oct 2, 2023Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated 8 months ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆32Feb 29, 2024Updated 2 years ago
- ☆21May 26, 2025Updated 9 months ago
- Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient☆20Nov 13, 2024Updated last year
- ☆101May 16, 2024Updated last year
- ☆15Apr 28, 2023Updated 2 years ago
- The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environmen…☆16Jun 9, 2023Updated 2 years ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆39Dec 5, 2023Updated 2 years ago
- FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods (ICCV 2023)☆21Feb 24, 2026Updated last week
- ☆21Oct 10, 2023Updated 2 years ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Aug 13, 2023Updated 2 years ago
- ☆45Nov 21, 2024Updated last year
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated last year
- official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data》☆19Dec 19, 2024Updated last year
- ☆51May 11, 2025Updated 9 months ago
- Experimental AI chat app☆23Jan 3, 2025Updated last year
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Sep 26, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- ☆29Dec 19, 2023Updated 2 years ago
- This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal …☆25Jul 23, 2023Updated 2 years ago
- Dynamic Frame Interpolation in Wavelet Domain (TIP 2023)☆20Sep 23, 2023Updated 2 years ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- ☆25Dec 22, 2023Updated 2 years ago
- ☆134Dec 22, 2023Updated 2 years ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- An Image/Text Retrieval Test Collection to Support Multimedia Content Creation☆21Oct 21, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated last year
- ☆58Apr 24, 2024Updated last year
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆55Oct 20, 2022Updated 3 years ago
- simple and efficient baselines for practical semantic segmentation with plain ViTs☆20Mar 9, 2024Updated last year
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆64Apr 8, 2025Updated 10 months ago
- ☆220Jul 5, 2024Updated last year
- This is the official PyTorch implementation of the paper “Neural Transformation Fields for Arbitrary-Styled Font Generation”.☆25Jun 10, 2024Updated last year
- ☆25Oct 5, 2023Updated 2 years ago