Flode-Labs / auto-labelerLinks
Label your images using GPT-4!
☆18Updated last year
Alternatives and similar repositories for auto-labeler
Users that are interested in auto-labeler are comparing it to the libraries listed below
Sorting:
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆122Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- AI narrator☆14Updated 2 years ago
- ☆17Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆76Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- Data extraction with LLM on CPU☆269Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆76Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 2 years ago
- ☆29Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated 2 years ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Not financial advice.☆28Updated 2 years ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆92Updated last week
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆24Updated 2 years ago
- Democratizing Function Calling Capabilities for Open-Source Language Models☆41Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆33Updated last year
- ☆30Updated last year
- Chat to Compose Video☆197Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆125Updated 5 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated last year
- ☆22Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago