Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
Alternatives and similar repositories for VLModel
Users that are interested in VLModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Apr 20, 2025Updated last year
- ☆22Jun 30, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".☆31Aug 23, 2024Updated last year
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆87Aug 23, 2024Updated last year
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding☆34Aug 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆64Mar 5, 2026Updated 2 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆51Apr 14, 2025Updated last year
- ☆49Oct 6, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆80Apr 30, 2026Updated 2 weeks ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆324Aug 10, 2024Updated last year
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆24Apr 9, 2026Updated last month
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 9 months ago
- ☆25Nov 30, 2023Updated 2 years ago
- ☆70Oct 19, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models☆131Aug 23, 2024Updated last year
- vscode extension for showing images in tile view☆11Mar 6, 2023Updated 3 years ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 2 months ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆93Sep 29, 2025Updated 7 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆177Sep 1, 2025Updated 8 months ago
- ☆14Sep 20, 2025Updated 8 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆138May 4, 2024Updated 2 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Feb 5, 2024Updated 2 years ago
- [ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".☆501Jan 9, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆30Nov 16, 2025Updated 6 months ago
- Unofficial implementation of "Diffusion Self-Guidance for Controllable Image Generation" (Epstein et al., 2023)☆33Jul 10, 2023Updated 2 years ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆107Mar 24, 2025Updated last year
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions☆43Mar 10, 2025Updated last year
- [T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm☆372Sep 30, 2025Updated 7 months ago
- [ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆556Dec 3, 2025Updated 5 months ago
- ☆54Aug 3, 2023Updated 2 years ago
- [CVPR 2026 Highlight] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆80Mar 25, 2026Updated last month
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆221Jan 24, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code for the NeurIPS19 paper and blog on "Uniform convergence may be unable to explain generalization in deep learning".☆10Oct 26, 2019Updated 6 years ago
- A Mechanistic View on Video Generation as World Models: State and Dynamics☆39Updated this week
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆46Jun 16, 2024Updated last year
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆82Oct 15, 2023Updated 2 years ago
- ☆15May 30, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago