Vision-oriented multimodal AI
☆52Jun 15, 2024Updated last year
Alternatives and similar repositories for SA-Segment-Anything
Users that are interested in SA-Segment-Anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Apr 7, 2026Updated 2 months ago
- Segment Anything in Defect Detection☆23Jun 2, 2024Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated last year
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Jun 14, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆25Dec 22, 2023Updated 2 years ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆38Dec 5, 2023Updated 2 years ago
- official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data》☆19Dec 19, 2024Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆67Apr 8, 2025Updated last year
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Apr 15, 2026Updated last month
- ☆101May 16, 2024Updated 2 years ago
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Aug 13, 2023Updated 2 years ago
- ☆21Jul 3, 2025Updated 11 months ago
- ☆34Jan 16, 2024Updated 2 years ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 3 months ago
- ☆19Dec 6, 2023Updated 2 years ago
- simple and efficient baselines for practical semantic segmentation with plain ViTs☆20Mar 9, 2024Updated 2 years ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆68May 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆38Feb 16, 2025Updated last year
- ☆52May 11, 2025Updated last year
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Oct 20, 2022Updated 3 years ago
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated last month
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated 2 years ago
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated 2 years ago
- ☆29Dec 19, 2023Updated 2 years ago
- ☆134Dec 22, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆76Sep 26, 2024Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- ☆55Jun 4, 2024Updated 2 years ago
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- Anatomy-aware self-supervised learning☆11Jun 22, 2024Updated last year
- From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving☆11Mar 16, 2025Updated last year
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 5 years ago