Flode-Labs / auto-labelerLinks
Label your images using GPT-4!
☆18Updated 2 years ago
Alternatives and similar repositories for auto-labeler
Users that are interested in auto-labeler are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- Data extraction with LLM on CPU☆270Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆97Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆99Updated this week
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆120Updated 2 years ago
- ☆75Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆77Updated 2 years ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated 2 years ago
- GPT-4V(ision) module for use with Autodistill.☆25Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆165Updated 6 months ago
- AI narrator☆15Updated 2 years ago
- ☆157Updated 2 years ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Updated last year
- This repo is a packaged version of the Yolov9 model.☆87Updated 2 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- ☆25Updated 2 years ago
- Using the moondream VLM with optical flow for promptable object tracking☆73Updated 11 months ago
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆21Updated 2 years ago
- ☆127Updated 10 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- ☆29Updated 2 years ago
- This repo contains codes covered in the youtube tutorials.☆87Updated 8 months ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- Simple CogVLM client script☆14Updated 2 years ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated 2 years ago
- ☆82Updated last year