A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
☆99Dec 17, 2024Updated last year
Alternatives and similar repositories for Mini-LLaVA
Users that are interested in Mini-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.☆20May 6, 2025Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- Visualize any repo or codebase into diagram or animation☆24Oct 14, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆27Jun 4, 2025Updated last year
- ☆13Apr 7, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- GRadient-INformed MoE☆264Sep 25, 2024Updated last year
- A tiny, didactical implementation of LLAMA 3☆42Dec 2, 2024Updated last year
- ☆11Nov 5, 2024Updated last year
- Directed masked autoencoders☆15Mar 25, 2026Updated 3 months ago
- [ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution☆329Jul 4, 2025Updated 11 months ago
- ☆13May 10, 2025Updated last year
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆53Apr 23, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Run SOTA Vision-Language Model Florence-2 on your data!☆15Mar 27, 2025Updated last year
- This repo contains the dataset for paper: Creating a Dataset Supporting Translation Between OpenMP Fortran and C++ Code☆15Dec 1, 2023Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 11 months ago
- Code-Switched translations with Large Language models☆25Dec 17, 2024Updated last year
- Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)☆16Jun 10, 2024Updated 2 years ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆211Jan 6, 2025Updated last year
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆123Mar 4, 2025Updated last year
- Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆23Oct 29, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated 2 years ago
- ☆12Apr 27, 2013Updated 13 years ago
- Supercharge your Gaianet node by generating a vector knowledge base from any API. Demo slides: https://hackmd.io/@santteegt/ByoykY4nC#/ L…☆11Nov 29, 2024Updated last year
- ☆12Jan 17, 2024Updated 2 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- Official implementation for LaCo (EMNLP 2024 Findings)☆22Oct 3, 2024Updated last year
- Unofficial Implementation of Evolutionary Model Merging☆42Mar 28, 2024Updated 2 years ago
- LocalPlexity is a lite version of Perplexity aimed at 100% privacy and openness. Everything is done locally, in your browser, from search…☆20Aug 12, 2024Updated last year
- ☆50Jun 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Jan 30, 2024Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Prune transformer layers☆74May 30, 2024Updated 2 years ago
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆26Aug 16, 2025Updated 10 months ago