CLUEbenchmark / SuperCLUE-ImageLinks
中文原生文生图测评基准
☆9Updated 11 months ago
Alternatives and similar repositories for SuperCLUE-Image
Users that are interested in SuperCLUE-Image are comparing it to the libraries listed below
Sorting:
- 中文原生多层次文生视频测评基准☆17Updated 11 months ago
- Our 2nd-gen LMM☆33Updated last year
- 【AIGC 实战入门笔记 —— AIGC 摩天大楼 】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语…☆14Updated 2 months ago
- Chinese CLIP models with SOTA performance.☆55Updated last year
- ☆28Updated last year
- CLIP中文encoder☆22Updated 3 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆27Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 9 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 9 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆37Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 11 months ago
- ☆19Updated 5 months ago
- Taiyi-Diffusion-XL训练代码☆22Updated last year
- ☆68Updated last year
- ☆29Updated 10 months ago
- ChatSD is designed to make image generation tasks easily☆20Updated 2 years ago
- [CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.☆19Updated 2 months ago
- KDD 2024 AQA competition 2nd place solution☆11Updated 11 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- ☆32Updated 2 years ago
- ☆15Updated 5 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 7 months ago
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆18Updated 2 weeks ago
- breezedeus的各种分享☆22Updated 2 years ago
- A light proxy solution for HuggingFace hub.☆47Updated last year