xiteng01 / CVPR2023_foundation_model_Track1
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated last year
Alternatives and similar repositories for CVPR2023_foundation_model_Track1:
Users that are interested in CVPR2023_foundation_model_Track1 are comparing it to the libraries listed below
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year
- Large Multimodal Model☆14Updated 9 months ago
- ☆40Updated last year
- Building a VLM model starts from the basic module.☆11Updated 9 months ago
- ☆29Updated 3 years ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆16Updated 3 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 3 months ago
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆34Updated last year
- ☆19Updated 5 months ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- Code for weakly supervised segmentation of a single class☆17Updated 4 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆35Updated 3 years ago
- ☆67Updated last year
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆34Updated 2 years ago
- Multi-label classification based on timm, and add SimCLR to timm.☆37Updated 3 years ago
- Chinese CLIP models with SOTA performance.☆51Updated last year
- [ECCV 2020 Workshop] VIPirios Object Detection Champion☆43Updated last year
- 可以成功Lora微调的Qwen-VL模型☆18Updated last year
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Updated 3 years ago
- Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021☆56Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆11Updated 10 months ago
- ☆21Updated 4 years ago