UnicomAI / UniT2IXLLinks
☆57Updated 9 months ago
Alternatives and similar repositories for UniT2IXL
Users that are interested in UniT2IXL are comparing it to the libraries listed below
Sorting:
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- hf-mirror-cli 使用国内镜像,无需配置开箱即用,快速下载hugingface上的模型☆141Updated 8 months ago
- ☆79Updated last year
- ☆337Updated last week
- ☆240Updated 7 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆252Updated 2 months ago
- ☆186Updated 8 months ago
- ☆349Updated last year
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆260Updated 3 weeks ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆106Updated 3 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- Chinese CLIP models with SOTA performance.☆58Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆45Updated last month
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 11 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金: 阿里云金融大模型)☆366Updated last week
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆226Updated 4 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆292Updated 3 months ago
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆49Updated 4 months ago
- project page for ChatAnyone☆114Updated 6 months ago
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆210Updated this week
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆63Updated 11 months ago
- 视频分类标注、视频时空标注☆42Updated 2 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆25Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆64Updated 3 weeks ago
- Taiyi-Diffusion-XL训练代码☆23Updated last year
- ☆81Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆140Updated 8 months ago
- An easy use face swap tool for images and tools only depend on onnxruntime.☆78Updated last year