Vision-oriented multimodal AI
☆52Jun 15, 2024Updated last year
Alternatives and similar repositories for SA-Segment-Anything
Users that are interested in SA-Segment-Anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Apr 7, 2026Updated 3 weeks ago
- ☆32Feb 29, 2024Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated 10 months ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- ☆21Oct 10, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MIMIC: Masked Image Modeling with Image Correspondences☆16Jun 14, 2024Updated last year
- ☆25Dec 22, 2023Updated 2 years ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆38Dec 5, 2023Updated 2 years ago
- official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data》☆19Dec 19, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated 2 years ago
- Efficient Point-based 3D Semantic Occupancy Prediction☆173Jul 13, 2024Updated last year
- ☆11Oct 2, 2023Updated 2 years ago
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆65Apr 8, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch Implementation for InMaP☆12Oct 28, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Apr 15, 2026Updated 2 weeks ago
- ☆101May 16, 2024Updated last year
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 2 months ago
- ☆19Dec 6, 2023Updated 2 years ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- ☆15Jun 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods (ICCV 2023)☆21Feb 24, 2026Updated 2 months ago
- ☆52May 11, 2025Updated 11 months ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Oct 20, 2022Updated 3 years ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- ☆134Dec 22, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆76Sep 26, 2024Updated last year
- ☆55Jun 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- Dynamic Frame Interpolation in Wavelet Domain (TIP 2023)☆20Sep 23, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Jun 12, 2023Updated 2 years ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environmen…☆16Apr 3, 2026Updated 3 weeks ago
- ☆15Apr 28, 2023Updated 3 years ago