☆48May 24, 2023Updated 2 years ago
Alternatives and similar repositories for LLaVA
Users that are interested in LLaVA are comparing it to the libraries listed below
Sorting:
- Object Detection in images using Selective Search and EdgeBoxes algorithm☆33Oct 4, 2019Updated 6 years ago
- Creating the DeepSeek V3 model from scratch☆25Mar 28, 2025Updated 11 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet☆22Oct 28, 2021Updated 4 years ago
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆130Oct 6, 2025Updated 5 months ago
- Fast stand-alone C++ decoder for RNN-based NMT models☆31Dec 12, 2020Updated 5 years ago
- Video Reasoning Segmentation☆28Nov 29, 2024Updated last year
- LLM Inference with Microscaling Format☆34Nov 12, 2024Updated last year
- ☆35Nov 25, 2025Updated 3 months ago
- code for triplet GAN☆31Apr 9, 2018Updated 7 years ago
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆97Apr 14, 2025Updated 10 months ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Dec 18, 2021Updated 4 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- ☆10Nov 15, 2015Updated 10 years ago
- ☆55Feb 24, 2026Updated 2 weeks ago
- lshash for python3☆10Mar 21, 2018Updated 7 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 8 years ago
- ☆12Dec 14, 2022Updated 3 years ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- Header-only configuration file library for C++11☆12Nov 13, 2014Updated 11 years ago
- ☆11May 24, 2024Updated last year
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆57Nov 20, 2024Updated last year
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆12Jun 1, 2024Updated last year
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 4 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- ☆21Updated this week
- ☆12Aug 25, 2017Updated 8 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- ☆11Aug 4, 2022Updated 3 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago