☆14Dec 11, 2024Updated last year
Alternatives and similar repositories for VistaLLaMA
Users that are interested in VistaLLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Nov 28, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- Code for "Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space"☆21Mar 25, 2026Updated 3 weeks ago
- [Tensorflow] A Game Theoretic approach using GAN for Phishing URL synthesis and detection☆11Nov 14, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 7 months ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆27May 21, 2025Updated 10 months ago
- Unified Object Tracking Framework☆51Jun 20, 2022Updated 3 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆25Mar 10, 2026Updated last month
- [CVPR 2024] GeoAuxNet: Torwards Universal 3D Representation Learning for Multi-sensor Point Clouds☆18Mar 29, 2024Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- ☆13Oct 25, 2024Updated last year
- Phishing websites are fraudulent sites that impersonate a trusted party to gain access to sensitive information of an individual person o…☆14May 1, 2020Updated 5 years ago
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"☆15May 4, 2024Updated last year
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated last year
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 10 months ago
- ☆10Aug 23, 2022Updated 3 years ago
- ☆11Jan 14, 2021Updated 5 years ago
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆20Apr 30, 2025Updated 11 months ago
- Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,☆18Nov 22, 2024Updated last year
- [CVPR2022] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis☆100Jun 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆37Aug 21, 2023Updated 2 years ago
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆51Feb 8, 2026Updated 2 months ago
- Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'☆45Jan 19, 2026Updated 2 months ago
- ☆18Jun 12, 2025Updated 10 months ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- ☆38Jan 10, 2026Updated 3 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆33Feb 28, 2026Updated last month
- (AAAI25) This is the official code repository for "MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios".☆16May 30, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Keras reimplementation of the 2015 ICCV paper "Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutio…☆13Feb 19, 2020Updated 6 years ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆90Sep 23, 2025Updated 6 months ago
- (AAAI 2025) Official PyTorch implementation of paper "SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection".☆27Nov 2, 2025Updated 5 months ago
- Market-1501 dataset with super-resolution quality☆21May 12, 2022Updated 3 years ago
- Progetto per la prova finale di Ingegneria del Software 2023-2024 al Politecnico di Milano☆10Oct 19, 2024Updated last year
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Nov 25, 2024Updated last year
- [CVPR2025] VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding☆24Mar 24, 2025Updated last year