cognitedata / Qwen-VL-finetuneLinks
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
☆10Updated last year
Alternatives and similar repositories for Qwen-VL-finetune
Users that are interested in Qwen-VL-finetune are comparing it to the libraries listed below
Sorting:
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 6 months ago
- Download flickr8k, flickr30k image caption datasets☆23Updated last year
- Timm model explorer☆39Updated last year
- PyTorch, PyTorch Lightning framework for trying knowledge distillation in image classification problems☆32Updated 10 months ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆93Updated 5 months ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- Fine tuning OpenAI's CLIP model on Indian Fashion Dataset☆50Updated 2 years ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆25Updated 4 months ago
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆25Updated 8 months ago
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- Playground for Transformers☆52Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- ☆13Updated 2 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- Pytorch implementation of Light FlowNet☆15Updated 5 years ago
- code for the ddp tutorial☆32Updated 3 years ago
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆22Updated 3 years ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆21Updated 3 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated 10 months ago
- Simple image classification for custom dataset (pytorch-lightning, timm)☆27Updated 2 years ago
- PyTorch implementation of Teacher-Student Network(Knowledge Distillation).☆26Updated 3 years ago