roboflow / gpt-checkup
Monitor the performance of OpenAI's GPT-4V model over time.
β31Updated this week
Related projects β
Alternatives and complementary repositories for gpt-checkup
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ77Updated last year
- β20Updated 5 months ago
- Summarize any Arixv Paper with easeβ60Updated last year
- β13Updated 11 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"β18Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β34Updated last year
- EdgeSAM model for use with Autodistill.β25Updated 5 months ago
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.β70Updated 4 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β59Updated 3 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.β108Updated 2 months ago
- Templates for computer vision projects, referenced in Roboflow blog posts.β18Updated 7 months ago
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826β52Updated last month
- MetaCLIP module for use with Autodistill.β21Updated 11 months ago
- How to use bounding boxes with the Gemini APIβ90Updated 4 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β65Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β77Updated 5 months ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimationβ45Updated last month
- OpenAI's Realtime API minus the enterprise bloatβ21Updated last week
- β60Updated last year
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNeβ¦β35Updated 6 months ago
- β41Updated 5 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ44Updated last year
- Enhancement in Multimodal Representation Learning.β39Updated 8 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.β45Updated 3 months ago
- Passively collect images for computer vision datasets on the edge.β26Updated last year
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timmβ¦β119Updated this week
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β92Updated 3 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to eβ¦β22Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ49Updated this week