xiteng01 / CVPR2023_foundation_model_Track1
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated 2 years ago
Alternatives and similar repositories for CVPR2023_foundation_model_Track1:
Users that are interested in CVPR2023_foundation_model_Track1 are comparing it to the libraries listed below
- Large Multimodal Model☆15Updated last year
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Updated 5 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆39Updated 7 months ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆16Updated 2 months ago
- ☆25Updated 8 months ago
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆43Updated 5 months ago
- ChineseCLIP using online learning☆13Updated 2 years ago
- ☆40Updated last year
- ☆20Updated last week
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆34Updated 2 years ago
- This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in☆26Updated last year
- [ECCV 2020 Workshop] VIPirios Object Detection Champion☆44Updated last year
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆56Updated 6 months ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆11Updated last year
- ☆16Updated 3 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated last year
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Updated 2 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆35Updated 3 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- ☆56Updated last year
- ☆18Updated 2 years ago
- ☆14Updated last year
- ☆29Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection☆41Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆103Updated last year
- ☆21Updated 2 years ago
- ☆18Updated 2 years ago