PKU-YuanGroup / LLaVA-o1View external linksLinks
☆56Nov 21, 2024Updated last year
Alternatives and similar repositories for LLaVA-o1
Users that are interested in LLaVA-o1 are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆2,125Dec 12, 2025Updated 2 months ago
- ☆16Jul 23, 2024Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- ☆11Apr 21, 2025Updated 9 months ago
- ☆15Jun 6, 2024Updated last year
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆17Jul 1, 2024Updated last year
- Some preliminary explorations of Mamba's context scaling.☆13Dec 18, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- ☆22Oct 22, 2024Updated last year
- About The official GitHub page for ''Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with …☆29Dec 12, 2024Updated last year
- ☆21Sep 5, 2023Updated 2 years ago
- Memory-Bounded GPU Acceleration for Vector Search☆33Dec 29, 2025Updated last month
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆27Sep 24, 2025Updated 4 months ago
- A deep neural network based system for realtime underwater color correction onboard AUVs.☆30Mar 28, 2024Updated last year
- NeurIPS 2024 (spotlight): A Textbook Remedy for Domain Shifts Knowledge Priors for Medical Image Analysis☆30Oct 15, 2024Updated last year
- [ICML 2024 - Foundation Models in the Wild] DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection☆29Aug 2, 2024Updated last year
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆30May 18, 2025Updated 8 months ago
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated last year
- ☆32Feb 8, 2024Updated 2 years ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆126Jan 14, 2025Updated last year
- ☆35Jan 21, 2025Updated last year
- ☆11Jul 13, 2025Updated 7 months ago
- ☆35May 13, 2025Updated 9 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Testing Language Models for Memorization of Tabular Datasets.☆36Feb 10, 2025Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆41Jan 8, 2026Updated last month
- ☆38Feb 8, 2024Updated 2 years ago
- ☆34Sep 14, 2024Updated last year
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- Avionics software to be developed and passed down over multiple tours.☆11May 25, 2020Updated 5 years ago
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- ☆41Sep 25, 2023Updated 2 years ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- Deep Reasoning Translation (DRT) Project☆241Sep 1, 2025Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆42Jun 2, 2025Updated 8 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆101Oct 23, 2024Updated last year