[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enlarged hidden dimension to build super frontier vision language models.
☆64Oct 9, 2024Updated last year
Alternatives and similar repositories for Phantom
Users that are interested in Phantom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆99Jun 28, 2024Updated last year
- [CVPR 2023] Official PyTorch Implementation for "Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust N…☆46Jul 18, 2023Updated 2 years ago
- Official PyTorch Implementation Code for Developing Super Fast Adversarial Training with Distributed Data Parallel, Channel Last Memory F…☆33Mar 13, 2023Updated 3 years ago
- Modification to YOLO for improving Dynamic Real-Time Processing on Robotics Operating Systems for Autonomous Vehicle System☆21Feb 16, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [OpenReview] Official PyTorch Implementation for "Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variatio…☆23Feb 15, 2022Updated 4 years ago
- Robustly Converting Camera View from Normal View to Top View for Autonomous Vehicle System on Robotics Operating System (ROS)☆24Jan 29, 2020Updated 6 years ago
- ☆12Dec 4, 2024Updated last year
- (Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"☆21Sep 15, 2020Updated 5 years ago
- [CVPR 2022] Official PyTorch Implementation for "Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network"☆32Mar 13, 2023Updated 3 years ago
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆117May 30, 2024Updated last year
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆80Dec 10, 2024Updated last year
- Official Implementation of Video-MA2MBA☆12Dec 3, 2024Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hetsgg☆28Mar 6, 2023Updated 3 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆110May 27, 2025Updated 11 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Pytorch Implmentation of Meta Attack via Contrastive Surrogate Objective☆12May 21, 2024Updated 2 years ago
- [COLM'25] Official implementation of the Law of Vision Representation in MLLMs☆177Oct 6, 2025Updated 7 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Jan 24, 2024Updated 2 years ago
- ☆17Dec 13, 2023Updated 2 years ago
- Implementation of LaViC (KDD 2025)☆12Jun 1, 2025Updated 11 months ago
- ☆11Jun 21, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- Accelerating the development of large multimodal models (LMMs) with lmms-eval☆14Oct 14, 2024Updated last year
- Github repo for MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning☆19Jun 12, 2024Updated last year
- SFT+RL boosts multimodal reasoning☆49Jun 27, 2025Updated 11 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆29Jul 9, 2025Updated 10 months ago
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆197Mar 17, 2025Updated last year
- ☆12Jun 12, 2024Updated last year
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Apr 20, 2025Updated last year
- Official implementation of project Honeybee (CVPR 2024)☆468May 10, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- ☆16Mar 26, 2025Updated last year
- Advanced Energy Control Management System (Advanced-ECMS) for Electrical Vehicle System using proposed Plus Version of Alternating Direct…☆29Feb 15, 2022Updated 4 years ago
- ☆35Jan 21, 2025Updated last year
- The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".☆14Jul 23, 2024Updated last year
- [NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆23Oct 21, 2025Updated 7 months ago
- A PyTorch implementation for the paper: Fully Convolutional Scene Graph Generation, CVPR 2021☆30Aug 5, 2022Updated 3 years ago