Vision-oriented multimodal AI
☆52Jun 15, 2024Updated last year
Alternatives and similar repositories for SA-Segment-Anything
Users that are interested in SA-Segment-Anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Feb 29, 2024Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated 10 months ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- ☆21Oct 10, 2023Updated 2 years ago
- MIMIC: Masked Image Modeling with Image Correspondences☆17Jun 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Dec 22, 2023Updated 2 years ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆38Dec 5, 2023Updated 2 years ago
- official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data》☆19Dec 19, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated last year
- Efficient Point-based 3D Semantic Occupancy Prediction☆173Jul 13, 2024Updated last year
- ☆11Oct 2, 2023Updated 2 years ago
- PyTorch Implementation for InMaP☆11Oct 28, 2023Updated 2 years ago
- ☆101May 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Aug 13, 2023Updated 2 years ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated last month
- ☆19Dec 6, 2023Updated 2 years ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆68May 9, 2025Updated 11 months ago
- ☆38Feb 16, 2025Updated last year
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- ☆15Jun 17, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation (Published in WACV 2024 W…☆24Mar 10, 2026Updated last month
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆14Mar 11, 2025Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- ☆29Dec 19, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆76Sep 26, 2024Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- ☆55Jun 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- Anatomy-aware self-supervised learning☆11Jun 22, 2024Updated last year
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago
- The official repo of continuous speculative decoding☆32Mar 28, 2025Updated last year
- A hobby project that dewarps book pages in images☆19Jan 5, 2023Updated 3 years ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- ☆11Jul 3, 2022Updated 3 years ago