ForJadeForest / ImageSearchLightningCLIP
Using distilled CLIP model to deploy the android device
☆19Updated 2 years ago
Alternatives and similar repositories for ImageSearchLightningCLIP:
Users that are interested in ImageSearchLightningCLIP are comparing it to the libraries listed below
- A demo for running quantized CLIP model (ViT-B/32) on Android.☆42Updated last year
- CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android☆243Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆19Updated last month
- segment anything model (SAM) infer by ncnn on Android mobile phone☆27Updated last year
- CLIP中文encoder☆22Updated 2 years ago
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆45Updated 10 months ago
- ☆56Updated last year
- RapidOcr onnxruntime推理 for Android☆69Updated last week
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆20Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆49Updated last year
- ☆40Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- ☆28Updated last year
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 6 months ago
- Chinese CLIP models with SOTA performance.☆54Updated last year
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆64Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆79Updated 6 months ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆82Updated 2 months ago
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆68Updated 5 months ago
- AI Eye的手机端的代码。可以实现视力检测、色盲检测、散光检测等,同时基于Mediapipe开发,实现了单目摄像头的测距和手势识别。This is the android version of app named "AI-Eye" using Mediapipe. It…☆43Updated last year
- 基于PaddleSeg的ModNet算法实现人像抠图(安卓版demo)☆62Updated 3 years ago
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆72Updated 11 months ago
- A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.☆46Updated last week
- Android and Windows human matting demo infer by ncnn☆58Updated 2 years ago
- Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs☆90Updated 2 months ago
- [NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)☆124Updated last year
- Android Studio基于mediapipe的手势控制☆10Updated 5 years ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆85Updated 7 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆139Updated 8 months ago