camenduru / LLaVA-colab
☆219Updated last year
Alternatives and similar repositories for LLaVA-colab:
Users that are interested in LLaVA-colab are comparing it to the libraries listed below
- ☆708Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆726Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆123Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- Example code for extracting Q&A datasets from LLM's☆79Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆161Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆181Updated 11 months ago
- LLaVA-Interactive-Demo☆365Updated 7 months ago
- ☆54Updated last year
- Docker image for LLaVA: Large Language and Vision Assistant☆1Updated 8 months ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"☆265Updated 9 months ago
- ☆82Updated last year
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 8 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆165Updated last year
- Haystack and Mistral 7B RAG Implementation. It is based on completely open-source stack.☆79Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆200Updated 10 months ago
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆69Updated last year
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆93Updated 9 months ago
- Quick exploration into fine tuning florence 2☆303Updated 5 months ago
- ☆189Updated last year
- Local LLM ReAct Agent with Guidance☆157Updated last year
- One click templates for inferencing Language Models☆162Updated this week
- Awesome LLM application repo☆67Updated this week
- ☆77Updated last year
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆123Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆384Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 6 months ago