taishan1994 / MiniClipLinks
动手训练一个简单的CLIP模型,加深对CLIP的理解。
☆22Updated 7 months ago
Alternatives and similar repositories for MiniClip
Users that are interested in MiniClip are comparing it to the libraries listed below
Sorting:
- 多模态 MM +Chat 合集☆280Updated 4 months ago
- Pytorch分布式训练框架☆84Updated last month
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆496Updated 4 months ago
- README.md☆48Updated 2 years ago
- ☆57Updated 2 years ago
- ☆31Updated last year
- A toolbox of yolo models and algorithms based on MindSpore☆170Updated last month
- ☆71Updated 2 years ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆194Updated 4 months ago
- Building a VLM model starts from the basic module.☆18Updated last year
- async inference for machine learning model☆26Updated 3 years ago
- DINOv3训练示例☆118Updated last month
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆94Updated 9 months ago
- Fine tuning grounding Dino☆153Updated 5 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆108Updated last year
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆128Updated 7 months ago
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆117Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- ☆33Updated 10 months ago
- This project showcases the deployment of the RT-DETR model using ONNXRUNTIME in C++ and Python.☆58Updated 2 years ago
- A toolbox of vision models and algorithms based on MindSpore☆265Updated 5 months ago
- 这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)☆35Updated 4 months ago
- 这是一个clip-pytorch的模型,可以训练自己的数据集。☆247Updated 2 years ago
- yolov5 tensorrt int8量化方法汇总☆85Updated 2 years ago
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆105Updated last year
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆53Updated 2 years ago
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆172Updated 11 months ago
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆87Updated last year
- ☆42Updated 11 months ago