mindspore-lab / mindface
MindFace is an open source toolkit based on MindSpore, containing the most advanced face recognition and detection models, such as ArcFace, RetinaFace and other models
☆46Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mindface
- A toolbox of vision models and algorithms based on MindSpore☆237Updated 2 weeks ago
- A toolbox of yolo models and algorithms based on MindSpore☆101Updated this week
- A toolbox of ocr models and algorithms based on MindSpore☆216Updated this week
- Multimodal chatbot with computer vision capabilities integrated☆98Updated 5 months ago
- A collection of diffusion models based on MindSpore☆158Updated 9 months ago
- ☆73Updated last year
- one for all, Optimal generator with No Exception☆364Updated this week
- Research Code for Multimodal-Cognition Team in Ant Group☆121Updated 3 months ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆83Updated 2 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆63Updated last month
- ☆33Updated 3 weeks ago
- ☆156Updated 8 months ago
- Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pret…☆351Updated this week
- ☆85Updated 4 months ago
- SKU级别的商品图像数据集汇总☆38Updated last year
- ☆67Updated this week
- ☆66Updated last year
- 多模态 MM +Chat 合集☆204Updated this week
- run ChatGLM2-6B in BM1684X☆48Updated 8 months ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆20Updated 9 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆54Updated 2 weeks ago
- ☆77Updated 6 months ago
- Efficient Multimodal Large Language Models: A Survey☆269Updated 2 months ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆66Updated 3 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆51Updated last week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 6 months ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆110Updated this week
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆32Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆10Updated 2 months ago