Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
Alternatives and similar repositories for VLModel
Users that are interested in VLModel are comparing it to the libraries listed below
Sorting:
- ☆17Apr 20, 2025Updated 11 months ago
- ☆22Jun 30, 2023Updated 2 years ago
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding☆31Aug 23, 2024Updated last year
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Aug 23, 2024Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆49Apr 14, 2025Updated 11 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆63Mar 5, 2026Updated 2 weeks ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆53Jul 6, 2025Updated 8 months ago
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆79Nov 17, 2025Updated 4 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 7 months ago
- ☆25Nov 30, 2023Updated 2 years ago
- ☆37Oct 21, 2022Updated 3 years ago
- ☆70Oct 19, 2023Updated 2 years ago
- [ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models☆131Aug 23, 2024Updated last year
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆85Sep 29, 2025Updated 5 months ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 2 weeks ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 6 months ago
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆27Nov 16, 2025Updated 4 months ago
- ☆14Sep 20, 2025Updated 6 months ago
- Code for "PATS: Patch Area Transportation with Subdivision for Local Feature Matching", CVPR 2023☆96Aug 28, 2023Updated 2 years ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆137May 4, 2024Updated last year
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- Unofficial implementation of "Diffusion Self-Guidance for Controllable Image Generation" (Epstein et al., 2023)☆33Jul 10, 2023Updated 2 years ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆107Mar 24, 2025Updated 11 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions☆44Mar 10, 2025Updated last year
- [T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm☆370Sep 30, 2025Updated 5 months ago
- [ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆551Dec 3, 2025Updated 3 months ago
- ☆54Aug 3, 2023Updated 2 years ago
- The code for the NeurIPS19 paper and blog on "Uniform convergence may be unable to explain generalization in deep learning".☆10Oct 26, 2019Updated 6 years ago
- A Mechanistic View on Video Generation as World Models: State and Dynamics☆31Mar 9, 2026Updated last week
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception☆298Sep 21, 2025Updated 5 months ago
- ☆13Apr 23, 2025Updated 10 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- Let us try implementing SAN in pytorch from scratch☆16Jun 7, 2018Updated 7 years ago
- ☆15May 30, 2024Updated last year
- Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.☆10Sep 18, 2020Updated 5 years ago
- ☆13Aug 14, 2022Updated 3 years ago