autodistill / autodistill-gpt-4vLinks
GPT-4V(ision) module for use with Autodistill.
β25Updated last year
Alternatives and similar repositories for autodistill-gpt-4v
Users that are interested in autodistill-gpt-4v are comparing it to the libraries listed below
Sorting:
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β32Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ87Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β37Updated 2 years ago
- β20Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within secondsβ134Updated 3 weeks ago
- Notebooks using the Neural Magic libraries πβ39Updated last year
- Finetune any model on HF in less than 30 secondsβ55Updated last week
- EdgeSAM model for use with Autodistill.β29Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ47Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β65Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zetaβ16Updated 11 months ago
- β50Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- β18Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 11 months ago
- Large Language Model (LLM) Inference API and Chatbotβ126Updated last year
- Tools for merging pretrained large language models.β19Updated last year
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- β80Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.β36Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.β16Updated 2 years ago
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated 2 years ago
- Not financial advice.β27Updated 2 years ago
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.β18Updated last year
- The Next Generation Multi-Modality Superintelligenceβ69Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β76Updated 2 years ago
- Data extraction with LLM on CPUβ112Updated last year
- AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questionsβ248Updated 10 months ago