ForJadeForest / ImageSearchLightningCLIP
Using distilled CLIP model to deploy the android device
☆19Updated last year
Alternatives and similar repositories for ImageSearchLightningCLIP:
Users that are interested in ImageSearchLightningCLIP are comparing it to the libraries listed below
- A demo for running quantized CLIP model (ViT-B/32) on Android.☆39Updated last year
- CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android☆237Updated last year
- CLIP中文encoder☆22Updated 2 years ago
- 基于PaddleSeg的ModNet算法实现人像抠图(安卓版demo)☆59Updated 3 years ago
- ☆56Updated last year
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆230Updated 11 months ago
- ☆67Updated last year
- ☆34Updated 10 months ago
- [NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)☆123Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆80Updated this week
- Research Code for Multimodal-Cognition Team in Ant Group☆133Updated 6 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆19Updated 11 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆56Updated 2 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆76Updated 4 months ago
- The deployment of deep learning models for mobile platforms includes some common CV and NLP tasks.☆25Updated 2 months ago
- 移动端开源人像分割☆51Updated 3 years ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 4 months ago
- ☆11Updated 8 months ago
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆63Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆104Updated 10 months ago
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆28Updated 2 months ago
- Traveling in the HarmonyOS World☆20Updated 8 months ago
- 基于TensorFlow 仿有道云笔记App端 文档扫描 功能☆12Updated 7 years ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆31Updated 3 months ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆74Updated 5 months ago
- ☆18Updated last year
- This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"☆100Updated this week
- ☆26Updated last year
- Chinese CLIP models with SOTA performance.☆53Updated last year
- segment anything model (SAM) infer by ncnn on Android mobile phone☆27Updated last year