xiteng01 / CVPR2023_foundation_model_Track1Links
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated 2 years ago
Alternatives and similar repositories for CVPR2023_foundation_model_Track1
Users that are interested in CVPR2023_foundation_model_Track1 are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- Large Multimodal Model☆15Updated last year
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆19Updated 7 months ago
- Building a VLM model starts from the basic module.☆18Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆46Updated 10 months ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated 2 years ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆101Updated 11 months ago
- 第九届中国软件杯视频全量分析“一等奖”&第十届中国软件杯A2百度paddlepaddle跟踪赛道“二等奖”☆10Updated 2 years ago
- ☆40Updated last year
- ☆69Updated last year
- Toward Universal Multimodal Embedding☆60Updated 2 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- ChineseCLIP using online learning☆13Updated 2 years ago
- [ECCV 2020 Workshop] VIPirios Object Detection Champion☆44Updated 2 years ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Updated 2 years ago
- 基于MindSpore AI框架实现零售商品识别 top1方案☆45Updated 3 years ago
- Implementation for Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification☆56Updated 3 years ago
- Research Code for Multimodal-Cognition Team in Ant Group☆167Updated 3 months ago
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Updated 3 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆29Updated last year
- This repository lists some awesome public Open World object detection series projects.☆25Updated last year
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆42Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated 2 years ago
- ☆71Updated 2 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆20Updated 10 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆62Updated 11 months ago
- ☆57Updated last year
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Updated 2 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Updated 4 years ago