camenduru / LLaVA-colab
☆219Updated last year
Alternatives and similar repositories for LLaVA-colab:
Users that are interested in LLaVA-colab are comparing it to the libraries listed below
- ☆707Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆726Updated last year
- Example code for extracting Q&A datasets from LLM's☆79Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- Local LLM ReAct Agent with Guidance☆157Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- From scratch implementation of a vision language model in pure PyTorch☆199Updated 10 months ago
- llama.cpp with BakLLaVA model describes what does it see☆384Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆224Updated 10 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆161Updated last year
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 8 months ago
- LLaVA-Interactive-Demo☆365Updated 7 months ago
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆93Updated 8 months ago
- Quick exploration into fine tuning florence 2☆303Updated 5 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆123Updated last year
- Automatically evaluate your LLMs in Google Colab☆600Updated 10 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆270Updated 7 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆179Updated 11 months ago
- ☆168Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆164Updated last year
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed☆128Updated 10 months ago
- ☆82Updated last year
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆123Updated last year
- Haystack and Mistral 7B RAG Implementation. It is based on completely open-source stack.☆79Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- ☆188Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- ☆77Updated last year
- GPT-4 Vision Chatbot examples☆60Updated last year
- Langchain implementation of HuggingGPT☆126Updated last year