xiteng01 / CVPR2023_foundation_model_Track1
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated last year
Alternatives and similar repositories for CVPR2023_foundation_model_Track1:
Users that are interested in CVPR2023_foundation_model_Track1 are comparing it to the libraries listed below
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- ☆21Updated 7 months ago
- Building a VLM model starts from the basic module.☆13Updated 11 months ago
- ☆40Updated last year
- Large Multimodal Model☆14Updated 11 months ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆34Updated 2 years ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 5 months ago
- This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in☆25Updated last year
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Updated 2 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆16Updated last month
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆35Updated 3 years ago
- ☆16Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- ☆29Updated 3 years ago
- ☆27Updated 10 months ago
- General Image Classification Code base☆21Updated 3 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- ☆21Updated 2 years ago
- Cosine Annealing Warm Booting LR☆27Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- ChineseCLIP using online learning☆12Updated 2 years ago
- [ECCV 2020 Workshop] VIPirios Object Detection Champion☆44Updated last year
- AIGCDetectBaseline☆11Updated 8 months ago
- 使用opencv部署DBNet文字检测,包含C++和Python两种版本的实现☆33Updated 3 years ago
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆39Updated 3 months ago
- ☆57Updated 2 years ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆19Updated last year