对llava官方代码的一些学习笔记
☆29Oct 11, 2024Updated last year
Alternatives and similar repositories for llava-handbook
Users that are interested in llava-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆209Jul 17, 2025Updated 11 months ago
- ☆10May 8, 2024Updated 2 years ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- ☆37Jul 21, 2025Updated 10 months ago
- Get CLIP ViT text tokens about an image, visualize attention as a heatmap.☆15Aug 8, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Sep 5, 2025Updated 9 months ago
- ☆71Oct 7, 2024Updated last year
- Large Language Models Can Be Contextual Privacy Protection Learners☆16Oct 28, 2024Updated last year
- ☆23May 28, 2025Updated last year
- The code for ACM MM2024 (Multimodal Unlearnable Examples: Protecting Data against Multimodal Contrastive Learning)☆14Jul 18, 2024Updated last year
- ☆14Jan 15, 2023Updated 3 years ago
- ☆20May 14, 2024Updated 2 years ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 4 months ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Feb 3, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 4 months ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆12May 19, 2026Updated last month
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆13Apr 10, 2023Updated 3 years ago
- Official Implementation of the ACL2024 Findings paper "Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attr…☆18May 18, 2024Updated 2 years ago
- The codes of our paper "EasyInv: Toward Fast and Better DDIM Inversion"☆14Jun 1, 2025Updated last year
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆19Feb 4, 2026Updated 4 months ago
- ☆20May 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Mar 10, 2026Updated 3 months ago
- Plamber is the multiplatform web-oriented system for reading and storing books online.☆10Dec 8, 2022Updated 3 years ago
- [NeurIPS 2025 D&B] BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model☆27Aug 1, 2025Updated 10 months ago
- ☆14May 21, 2025Updated last year
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- An unofficial pytorch implementation of the BiHDM model proposed by Yang et al. for decoding emotion from multi-channel EEG recordings, w…☆15Apr 6, 2023Updated 3 years ago
- Detection of LLM-Generated Codes [ICSE2025]☆35Jul 5, 2025Updated 11 months ago
- [TOSEM'26] Awesome-LLM4SVD from "A Systematic Literature Review on Detecting Software Vulnerabilities with Large Language Models"☆75May 21, 2026Updated 3 weeks ago
- Convert .vox to .obj☆14Nov 24, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of deep-multi-trajectory-based single object tracking (IEEE T-CSVT 2021).☆12Aug 24, 2022Updated 3 years ago
- ☆12Mar 27, 2025Updated last year
- The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical found…☆25Feb 19, 2026Updated 4 months ago
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆45Jul 1, 2024Updated last year
- [SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…☆18Oct 5, 2024Updated last year
- ☆21Aug 6, 2025Updated 10 months ago
- Pure-PyTorch Parakeet TDT inference☆47Mar 10, 2026Updated 3 months ago