Experiments with LAVIS library to perform image2text and text2image retrieval with BLIP and BLIP2 models
☆15Sep 25, 2023Updated 2 years ago
Alternatives and similar repositories for image_text_retrieval_BLIP_BLIP2
Users that are interested in image_text_retrieval_BLIP_BLIP2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML2025 Oral] LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently☆32Oct 22, 2025Updated 7 months ago
- ☆15May 17, 2023Updated 3 years ago
- Pytorch implementation of BiomedCLIP vision model with LoRA tuning☆44May 18, 2023Updated 3 years ago
- Transform a 2D point distribution to a hex grid to avoid overplotting in data visualizations☆18Sep 4, 2024Updated last year
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LV Volumes Prediction based on Multi-views Fusion CNN☆10Dec 10, 2020Updated 5 years ago
- Module providing brain MR images pre-processing workflows for Deep Learning.☆16May 12, 2026Updated last month
- 该项目旨在通过输入文本描述来检索与之相匹配的图片。☆43Aug 24, 2023Updated 2 years ago
- ☆20May 14, 2024Updated 2 years ago
- AI大模型的基本开发框架,适合普通后端程序员,功能类似coze包括:fastapi后端接口,搜索,文档解析和向量化,RPA和爬虫,自定义agent,对接第三方数据接口,mongodb数据库,控制json返回,多模态理解和生成等等☆13Jul 18, 2024Updated last year
- Learn how to create impactful AI Agents using Agno AI Python Package☆13Jul 31, 2025Updated 10 months ago
- 基于vllm部署qwen2.5_vl实现视频流的实时识别☆20Apr 1, 2025Updated last year
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated 2 years ago
- FetalNet: Multi-Task Deep Learning Framework for Fetal Ultrasound Biometric Measurements☆22Dec 14, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 特斯拉锁车搞笑段子合集☆27Jan 25, 2024Updated 2 years ago
- Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)☆17Dec 29, 2021Updated 4 years ago
- An unofficial pytorch implementation of the BiHDM model proposed by Yang et al. for decoding emotion from multi-channel EEG recordings, w…☆15Apr 6, 2023Updated 3 years ago
- 迪三教程代码☆16May 13, 2025Updated last year
- A vision-based RL environment for the Franka Panda arm using NVIDIA Isaac Sim☆19Jan 3, 2025Updated last year
- K means algorithm on the unit sphere☆13Sep 28, 2021Updated 4 years ago
- Multi-Head Attention, Transformer, Perceiver, Linear Attention.☆12Oct 24, 2023Updated 2 years ago
- Chinese BERT classification with tf2.0 and audio classification with mfcc☆14Dec 2, 2020Updated 5 years ago
- DoctorRAG is a medical AI that mimics doctor-like reasoning by combining textbook knowledge with insights from similar patient cases, usi…☆22May 21, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 长文本相似度模型☆21Nov 24, 2023Updated 2 years ago
- Official code repository for Med-TTT.☆19Jun 30, 2025Updated 11 months ago
- 这是一个用于与 RAGflow API 交互的 Python 客户端,支持数据集管理、文件管理、分块管理、聊天助手管理以及代理管理的完整功能。☆21Feb 21, 2025Updated last year
- ☆15Apr 22, 2024Updated 2 years ago
- Code repository for Rakuten Data Challenge: Multimodal Product Classification and Retrieval.☆30May 10, 2021Updated 5 years ago
- ☆24Apr 30, 2025Updated last year
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Sep 14, 2022Updated 3 years ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated last year
- A simple hashing, encoding, and generation library in C.☆13Mar 18, 2014Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 大作业, 基于大语言模型和视觉模型的AI健身助手(后端)☆13Dec 21, 2023Updated 2 years ago
- In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Pro…☆11Aug 23, 2021Updated 4 years ago
- ☆15Jul 24, 2017Updated 8 years ago
- 使用django+pyecharts+PP-Human开发的动态数据大屏, 有人流数据的采集入库, 打架、摔倒等事件警报,口罩检测等实用功能。边缘端版本使用onnx推理提升效率,服务端版本支持视频流推拉☆33May 3, 2023Updated 3 years ago
- 同济大学班车app作弊器,你懂的。☆12Apr 7, 2015Updated 11 years ago
- 基于Amazon Bedrock的多模态AIGC童话绘本☆19Jan 5, 2024Updated 2 years ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆26Feb 1, 2024Updated 2 years ago