mindspore-lab / mindface
MindFace is an open source toolkit based on MindSpore, containing the most advanced face recognition and detection models, such as ArcFace, RetinaFace and other models
☆46Updated 2 months ago
Alternatives and similar repositories for mindface:
Users that are interested in mindface are comparing it to the libraries listed below
- A toolbox of vision models and algorithms based on MindSpore☆245Updated last week
- A toolbox of yolo models and algorithms based on MindSpore☆125Updated 3 weeks ago
- A collection of diffusion models based on MindSpore☆161Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated 11 months ago
- A toolbox of ocr models and algorithms based on MindSpore☆267Updated 3 weeks ago
- ☆78Updated 2 years ago
- one for all, Optimal generator with No Exception☆411Updated 2 weeks ago
- ☆78Updated 11 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- ☆173Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆142Updated 9 months ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Updated last year
- X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages☆310Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆60Updated 5 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 9 months ago
- ☆104Updated last year
- ☆161Updated 2 weeks ago
- AAAI 2024: Visual Instruction Generation and Correction☆92Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆136Updated 3 months ago
- Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"☆145Updated 3 weeks ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆91Updated 7 months ago
- ☆108Updated last year
- [ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model☆328Updated 5 months ago
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆272Updated last year
- Chinese CLIP models with SOTA performance.☆55Updated last year
- A Simple Framework of Small-scale LMMs for Video Understanding☆56Updated last week
- -☆21Updated 2 years ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆55Updated 2 years ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- ☆32Updated 2 years ago