camenduru / LLaVA-colabLinks
☆222Updated last year
Alternatives and similar repositories for LLaVA-colab
Users that are interested in LLaVA-colab are comparing it to the libraries listed below
Sorting:
- ☆710Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆744Updated last year
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆122Updated last year
- Example code for extracting Q&A datasets from LLM's☆82Updated 2 years ago
- LLaVA-Interactive-Demo☆374Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆128Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆184Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- ☆81Updated last year
- This repo contains codes covered in the youtube tutorials.☆86Updated 3 weeks ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- Docker image for LLaVA: Large Language and Vision Assistant☆2Updated last month
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- ☆80Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- HPT - Open Multimodal LLMs from HyperGAI☆316Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆225Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- ☆431Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆327Updated last week
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆278Updated last year
- webcamGPT - chat with video stream 💬 + 📸☆264Updated last year
- ☆193Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …☆361Updated last year