pytorch-labs / segment-anything-fast
A batched offline inference oriented version of segment-anything
☆1,204Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for segment-anything-fast
- ☆434Updated 7 months ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,383Updated 5 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,160Updated 5 months ago
- ☆1,766Updated 4 months ago
- Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"☆402Updated last month
- The way we interact with our data is changing.☆783Updated 3 months ago
- A collection of real world AI/ML exploits for responsibly disclosed vulnerabilities☆1,438Updated 3 weeks ago
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,083Updated last month
- Home of the Flutter Casual Games Toolkit and other Flutter gaming templates☆609Updated 2 weeks ago
- Make it real☆1,453Updated 4 months ago
- [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scala…☆4,277Updated last month
- Efficient vision foundation models for high-resolution generation and perception.☆2,354Updated last week
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,218Updated last month
- [CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation☆968Updated 3 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,289Updated last year
- Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆1,521Updated 3 months ago
- ☆4,641Updated last month
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,262Updated last month
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,362Updated 4 months ago
- Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"☆872Updated 4 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,003Updated last month
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL☆1,390Updated this week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆2,971Updated 3 weeks ago
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,228Updated 10 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,251Updated 7 months ago
- Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.☆1,766Updated 8 months ago
- [3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, …☆893Updated 5 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,693Updated last month
- Segment Anything Labelling Tool☆1,024Updated 9 months ago
- A realtime sketch to image demo using LCM and the gradio library.☆1,798Updated 11 months ago