SkalskiP / fashion-assistant

Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.
99Updated last year

Related projects

Alternatives and complementary repositories for fashion-assistant