Chinese CLIP models with SOTA performance.
☆62Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for QA-CLIP
Users that are interested in QA-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- ☆72Jun 28, 2023Updated 2 years ago
- DeepVAC-FACE test dataset.☆14May 13, 2021Updated 5 years ago
- TagGPT: Large Language Models are Zero-shot Multimodal Taggers☆67May 12, 2023Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 3 years ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆51Dec 30, 2023Updated 2 years ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 3 years ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆108Oct 25, 2024Updated last year
- ☆57Jan 23, 2024Updated 2 years ago
- ☆90Jul 4, 2024Updated last year
- ☆24Aug 17, 2024Updated last year
- Implemention of "Realtime Multi Person Pose-Estimation" in pytorch with data from AI Challenger☆13Nov 24, 2017Updated 8 years ago
- ☆11May 30, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆306Jan 8, 2024Updated 2 years ago
- Python-caffe simplified implementation of the R-CNN object detection method. I have taken as a starting point the caffe ipython-notebook …☆19Feb 1, 2015Updated 11 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,932Mar 31, 2026Updated 2 months ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆107Oct 20, 2023Updated 2 years ago
- pytorch实现AdvancedEast+mobilenetv3☆26Dec 25, 2019Updated 6 years ago
- 💻NUAA 2018 操作系统小作业-模拟内存分配程序(BF算法)☆13Jul 2, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Feb 29, 2024Updated 2 years ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 2 years ago
- Our 2nd-gen LMM☆34May 22, 2024Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 3 years ago
- 牛逼的A*算法,最好的寻路算法☆12Feb 28, 2020Updated 6 years ago
- ocr data ,detect data ,recognize data☆29Mar 24, 2020Updated 6 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Jun 28, 2024Updated last year
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆17Jun 20, 2023Updated 2 years ago
- https://arxiv.org/abs/2005.10497☆72Aug 11, 2020Updated 5 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆42Feb 21, 2023Updated 3 years ago
- 基于 MATLAB GUI 的自主水下航行器(AUV)海底探测路径规划应用程序。 集成了智能路径生成、Dubins曲线优化、障碍物避障等功能,支持实时数据可视化和AUV控制系统对接。☆25Oct 24, 2025Updated 7 months ago
- 🔥[Information Fusion 2024, Official Code] for paper "Prompt-guided image color aesthetics assessment: Models, datasets and benchmarks". …☆69Jul 29, 2025Updated 10 months ago
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆379Sep 23, 2023Updated 2 years ago
- 四种启发式算法(模拟退火、遗传算法、禁忌搜索、蚁群算法)解决TSP(旅行商问题)实例☆15Dec 27, 2019Updated 6 years ago