xiteng01 / CVPR2023_foundation_model_Track1
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated last year
Alternatives and similar repositories for CVPR2023_foundation_model_Track1:
Users that are interested in CVPR2023_foundation_model_Track1 are comparing it to the libraries listed below
- Building a VLM model starts from the basic module.☆12Updated 10 months ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- Large Multimodal Model☆14Updated 10 months ago
- ☆20Updated 5 months ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆12Updated 2 months ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆34Updated last year
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆16Updated 4 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 4 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- ☆28Updated 2 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆103Updated 10 months ago
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Updated 2 years ago
- ☆41Updated last year
- This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in☆25Updated last year
- ☆40Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆39Updated 2 months ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆35Updated 3 years ago
- Tensorflow implementation for Dash☆29Updated 2 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆11Updated last year
- ☆16Updated 11 months ago
- ☆38Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Updated last year
- ☆18Updated last year
- Recent OCR and related works on PaddlePaddle 2.0☆12Updated 3 years ago
- ☆25Updated 3 years ago
- ☆17Updated 2 years ago