roboflow / maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
β2,556Updated last week
Alternatives and similar repositories for maestro
Users that are interested in maestro are comparing it to the libraries listed below
Sorting:
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,431Updated 2 weeks ago
- 4M: Massively Multimodal Masked Modelingβ1,721Updated 2 months ago
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β616Updated last year
- β2,939Updated 8 months ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯β1,680Updated 4 months ago
- Turn any computer or edge device into a command center for your computer vision projects.β1,666Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β2,524Updated last month
- Everything about the SmolLM2 and SmolVLM family of modelsβ2,287Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,442Updated 3 months ago
- Llama-3 agents that can browse the web by following instructions and talking to youβ1,401Updated 5 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]β718Updated 10 months ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ1,984Updated this week
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.β1,281Updated 3 weeks ago
- Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.β3,099Updated this week
- Images to inference with no labeling (use foundation models to train supervised models).β2,254Updated this week
- Knowledge Agents and Management in the Cloudβ3,967Updated last week
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.β1,065Updated 3 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.β2,985Updated 2 months ago
- β707Updated last year
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.β2,066Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,972Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β945Updated 2 weeks ago
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithmsβ1,441Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.β2,436Updated this week
- A framework for prompt tuning using Intent-based Prompt Calibrationβ2,507Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,700Updated this week
- PyTorch native post-training libraryβ5,171Updated last week
- Implementation for Describe Anything: Detailed Localized Image and Video Captioningβ1,065Updated last week
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,380Updated 8 months ago
- π€ MLE-Agent: Your intelligent companion for seamless AI engineering and research. π Integrate with arxiv and paper with code to provideβ¦β1,277Updated 3 weeks ago