mindspore-lab / mindface
MindFace is an open source toolkit based on MindSpore, containing the most advanced face recognition and detection models, such as ArcFace, RetinaFace and other models
☆46Updated 2 months ago
Alternatives and similar repositories for mindface
Users that are interested in mindface are comparing it to the libraries listed below
Sorting:
- A toolbox of vision models and algorithms based on MindSpore☆246Updated 3 weeks ago
- A toolbox of yolo models and algorithms based on MindSpore☆127Updated last month
- A collection of diffusion models based on MindSpore☆162Updated last year
- ☆78Updated 2 years ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆122Updated 5 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated last year
- A toolbox of ocr models and algorithms based on MindSpore☆272Updated last month
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆60Updated 6 months ago
- 从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连…☆13Updated 3 months ago
- one for all, Optimal generator with No Exception☆414Updated 2 weeks ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- ☆177Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆143Updated 10 months ago
- A Token-level Text Image Foundation Model for Document Understanding☆91Updated 2 weeks ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- ☆56Updated last year
- ☆67Updated last year
- Chinese CLIP models with SOTA performance.☆55Updated last year
- ☆87Updated 10 months ago
- ☆28Updated last year
- ☆16Updated 2 years ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆90Updated 8 months ago
- A Simple Framework of Small-scale LMMs for Video Understanding☆61Updated last week
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆90Updated 6 months ago
- 一些大语言模型和多模态模型的应用,主要包括小模型,Agent,跨模态搜索,OCR、RAG、ChatBot等等☆170Updated this week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆56Updated 6 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆232Updated 2 months ago
- GUI for PaddleOCR whl based on Quicker☆12Updated 3 years ago
- 用于学习GOT/Qwen/OnnxLLm☆52Updated 7 months ago