ForJadeForest / ImageSearchLightningCLIP

Using distilled CLIP model to deploy the android device

☆19

Alternatives and similar repositories for ImageSearchLightningCLIP:

Users that are interested in ImageSearchLightningCLIP are comparing it to the libraries listed below

greyovo / CLIP-android-demo
A demo for running quantized CLIP model (ViT-B/32) on Android.
☆42Updated last year
EdVince / CLIP-ImageSearch-NCNN
CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android
☆243Updated last year
DataXujing / DeepSeek-R1-Android
安卓手机部署DeepSeek-R1 蒸馏的1.5B模型
☆19Updated last month
slz929 / SAM-Android-ncnn
segment anything model (SAM) infer by ncnn on Android mobile phone
☆27Updated last year
applenob / clip_chinese_text_encoder
CLIP中文encoder
☆22Updated 2 years ago
om-ai-lab / OVDEval
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
☆45Updated 10 months ago
Ucas-HaoranWei / Vary-family
☆56Updated last year
RapidAI / RapidOcrAndroidOnnx
RapidOcr onnxruntime推理 for Android
☆69Updated last week
percent4 / multi-modal-image-search
本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。
☆20Updated last year
hlchen23 / ADPN-MM
Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…
☆49Updated last year
alipay / mobile-agent
☆40Updated last year
callsys / TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
☆26Updated last year
slz929 / mobileSAM-Android-MNN
☆28Updated last year
bzluan / TextCoT
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
☆38Updated 6 months ago
TencentARC-QQ / QA-CLIP
Chinese CLIP models with SOTA performance.
☆54Updated last year
om-ai-lab / GroundVLP
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
☆64Updated last year
Ucas-HaoranWei / Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆79Updated 6 months ago
LinWeizheDragon / FLMR
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
☆82Updated 2 months ago
QQ-MM / Video-CCAM
A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.
☆68Updated 5 months ago
MikeDean2367 / AI-EYE
AI Eye的手机端的代码。可以实现视力检测、色盲检测、散光检测等，同时基于Mediapipe开发，实现了单目摄像头的测距和手势识别。This is the android version of app named "AI-Eye" using Mediapipe. It…
☆43Updated last year
qianbin1989228 / human_matting_android_demo
基于PaddleSeg的ModNet算法实现人像抠图（安卓版demo）
☆62Updated 3 years ago
xiteng01 / CVPR2023_foundation_model_Track1
Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)
☆18Updated 2 years ago
DataXujing / Qwen1.5-0.5b-chat-android
基于MNN-llm的安卓手机部署大语言模型：Qwen1.5-0.5B-Chat
☆72Updated 11 months ago
ZhangXJ199 / TinyLLaVA-Video
A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.
☆46Updated last week
FeiGeChuanShu / ncnn_Android_matting
Android and Windows human matting demo infer by ncnn
☆58Updated 2 years ago
SY-Xuan / Pink
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
☆90Updated 2 months ago
DRSY / MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
☆124Updated last year
Bryce-Hu / GestureCtrl
Android Studio基于mediapipe的手势控制
☆10Updated 5 years ago
TroyTzou / mlc-llm-android
参考自mlc-llm，个人尝试在android手机上部署大模型并运行
☆85Updated 7 months ago
alipay / Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
☆139Updated 8 months ago