pytorch-labs / segment-anything-fast
A batched offline inference oriented version of segment-anything
☆1,196Updated last month
Related projects ⓘ
Alternatives and complementary repositories for segment-anything-fast
- ☆432Updated 6 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,150Updated 5 months ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,367Updated 4 months ago
- A collection of real world AI/ML exploits for responsibly disclosed vulnerabilities☆1,427Updated 2 weeks ago
- Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"☆398Updated last month
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL☆1,381Updated this week
- The way we interact with our data is changing.☆776Updated 3 months ago
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,075Updated 2 weeks ago
- ☆1,763Updated 4 months ago
- Encodes a file into a video format to store on a cloud video hosting service☆903Updated 11 months ago
- Make it real☆1,450Updated 4 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,214Updated last month
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,250Updated 2 weeks ago
- [CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation☆965Updated 2 months ago
- [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scala…☆4,223Updated last month
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,245Updated 2 weeks ago
- [ECCV 2024] Tokenize Anything via Prompting☆521Updated 4 months ago
- [3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, …☆888Updated 4 months ago
- Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"☆865Updated 4 months ago
- ☆4,625Updated last month
- Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scena…☆780Updated last year
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,343Updated 3 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,664Updated 3 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆1,045Updated last week
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,224Updated 10 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆2,064Updated 6 months ago
- Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆1,519Updated 3 months ago
- Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"☆936Updated 2 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,689Updated last month
- Emu Series: Generative Multimodal Models from BAAI☆1,659Updated last month