sugarandgugu / Text2Image-RetrievalLinks
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
☆99Updated 2 years ago
Alternatives and similar repositories for Text2Image-Retrieval
Users that are interested in Text2Image-Retrieval are comparing it to the libraries listed below
Sorting:
- 该项目旨在通过输入文本描述来检索与之相匹配的图片。☆43Updated 2 years ago
- Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023☆92Updated 8 months ago
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆47Updated last year
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆558Updated 9 months ago
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Updated 3 years ago
- An LLM-based tool to chat with your documents and databases, including a management system | 面向企业内部环境的大模型(LLM)知识库问答系统,包含后台管理系统☆111Updated 2 years ago
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆106Updated 6 months ago
- Code for AAAl 2024 paper: Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects☆159Updated 10 months ago
- 2024.06.19 本项目使用Chinese-CLIP搭建文搜图/图搜图页面,旨在帮助用户快速使用跨模态检索任 务。本项目代码针对MUGE数据集约19w(189585张)数据作为底库数据。本项目提供了提取特征, 检索, 以及uI代码。☆20Updated last year
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆93Updated 2 years ago
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆233Updated last year
- Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection.☆105Updated last year
- Chinese large language model☆123Updated 2 years ago
- 从预训练到强化学习的中文llama2☆87Updated 2 years ago
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆187Updated 3 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Updated last year
- 基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署☆162Updated 10 months ago
- 目标检测,采用yolov8作为基准模型,数据集采用VisDrone2019,带有自己的改进策略☆120Updated last year
- Simple code demos about classic AIGC models/Compilation of blogs and papers on classic AIGC models.☆108Updated last year
- YiJian-Comunity: a full-process automated large model safety evaluation tool designed for academic research☆113Updated 3 weeks ago
- [AAAI'24 Oral] LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network☆47Updated last month
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆134Updated 8 months ago
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆159Updated 2 months ago
- Building a VLM model starts from the basic module.☆18Updated last year
- 毕业设计:《基于CLIP模型的视频文本检索设计与实现》☆16Updated last year
- Detect known and unknown objects in the open world(具有区分已知与未知能力的全新检测器))☆86Updated 2 years ago
- Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+…☆161Updated last month
- bert、roberta、ernie等方法进行文本分类☆88Updated 2 years ago
- ☆245Updated last year
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆582Updated last year