Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
Alternatives and similar repositories for VLModel
Users that are interested in VLModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Apr 20, 2025Updated last year
- ☆22Jun 30, 2023Updated 3 years ago
- ☆39Mar 5, 2026Updated 3 months ago
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".☆31Aug 23, 2024Updated last year
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆66Mar 5, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆52Apr 14, 2025Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆53Jul 6, 2025Updated 11 months ago
- ☆49Oct 6, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆81Apr 30, 2026Updated last month
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆324Aug 10, 2024Updated last year
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆14Jul 26, 2025Updated 11 months ago
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated 2 months ago
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆123May 20, 2026Updated last month
- TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction☆320Jun 12, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Nov 30, 2023Updated 2 years ago
- ☆36Oct 21, 2022Updated 3 years ago
- vscode extension for showing images in tile view☆11Mar 6, 2023Updated 3 years ago
- [3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting☆162Dec 9, 2025Updated 6 months ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 3 months ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆92Sep 29, 2025Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆178Sep 1, 2025Updated 9 months ago
- Code for "PATS: Patch Area Transportation with Subdivision for Local Feature Matching", CVPR 2023☆99Aug 28, 2023Updated 2 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Feb 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".☆498Jan 9, 2025Updated last year
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆29Nov 16, 2025Updated 7 months ago
- Unofficial implementation of "Diffusion Self-Guidance for Controllable Image Generation" (Epstein et al., 2023)☆33Jul 10, 2023Updated 2 years ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆107Mar 24, 2025Updated last year
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions☆45Mar 10, 2025Updated last year
- [T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm☆376Sep 30, 2025Updated 9 months ago
- [CVPR 2026 Highlight] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆87Jun 18, 2026Updated last week
- A Mechanistic View on Video Generation as World Models: State and Dynamics☆47Updated this week
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- ☆13Apr 23, 2025Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated 2 years ago
- Let us try implementing SAN in pytorch from scratch☆16Jun 7, 2018Updated 8 years ago
- ☆14May 30, 2024Updated 2 years ago
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆82Oct 15, 2023Updated 2 years ago
- Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.☆10Sep 18, 2020Updated 5 years ago