roboflow / gpt-checkup
Monitor the performance of OpenAI's GPT-4V model over time.
β27Updated this week
Related projects: β
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ75Updated 11 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β33Updated 11 months ago
- Summarize any Arixv Paper with easeβ60Updated last year
- EdgeSAM model for use with Autodistill.β24Updated 3 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"β18Updated last year
- β13Updated 9 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β80Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β54Updated last month
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within secondsβ119Updated 2 weeks ago
- β55Updated 3 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β64Updated 9 months ago
- β99Updated 3 weeks ago
- Simple CogVLM client scriptβ14Updated 9 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relβ¦β11Updated 7 months ago
- Flow Chart-2-Codeβ30Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.β26Updated this week
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editingβ65Updated 4 months ago
- MetaCLIP module for use with Autodistill.β21Updated 9 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNeβ¦β34Updated 4 months ago
- β50Updated last month
- Gradio UI for a Cog APIβ62Updated 5 months ago
- Github repo for Peifeng's internship projectβ12Updated 10 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.β96Updated last week
- Visual RAG using less than 300 lines of code.β22Updated 6 months ago
- Enhancement in Multimodal Representation Learning.β38Updated 6 months ago
- GPT-4V(ision) module for use with Autodistill.β25Updated last month
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"β32Updated last week
- β38Updated 4 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/exploreβ39Updated last week
- YouTube Assistantβ12Updated last year