Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for PVIT
Users that are interested in PVIT are comparing it to the libraries listed below
Sorting:
- ☆13Jun 4, 2025Updated 9 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆60Aug 23, 2024Updated last year
- Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection☆59Updated this week
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis☆34Dec 27, 2023Updated 2 years ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆44Apr 21, 2024Updated last year
- ☆21Oct 31, 2024Updated last year
- [ICLR 2026] Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos☆27Jan 26, 2026Updated last month
- 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)☆118Mar 26, 2025Updated 11 months ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- ☆32Nov 18, 2025Updated 3 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs (CVPR2024)☆17Jun 14, 2024Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆186Jul 5, 2024Updated last year
- ☆21Jul 25, 2025Updated 7 months ago
- A flexible & scalable MLLM-based AIGC detection pipeline☆28Oct 27, 2025Updated 4 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Mar 25, 2024Updated last year
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆20May 24, 2025Updated 9 months ago
- ☆17Jan 9, 2025Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- Official implement of MIA-DPO☆70Jan 23, 2025Updated last year
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆66Oct 27, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- A curated list of Awesome Personalized Large Multimodal Models resources☆55Feb 4, 2026Updated last month
- Official repo for StyleMe3D☆28Apr 22, 2025Updated 10 months ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆193Sep 7, 2025Updated 5 months ago
- ☆50Jan 6, 2025Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- Superpixel-guided Sampling for Compact 3d Gaussian Splatting☆22Nov 12, 2024Updated last year
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆52Oct 24, 2024Updated last year
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- Evaluating the faithfulness of long-context language models☆30Oct 21, 2024Updated last year
- Official implementation of MC-LLaVA.☆140Nov 10, 2025Updated 3 months ago
- ☆20May 26, 2020Updated 5 years ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆56Dec 3, 2025Updated 3 months ago
- ☆27Apr 11, 2023Updated 2 years ago