UnicomAI / UniT2IXLLinks
☆57Updated 8 months ago
Alternatives and similar repositories for UniT2IXL
Users that are interested in UniT2IXL are comparing it to the libraries listed below
Sorting:
- ☆79Updated last year
- ☆350Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- hf-mirror-cli 使用国内镜像,无需配置开箱即用,快速下载hugingface上的模型☆141Updated 7 months ago
- ☆181Updated 7 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆251Updated last month
- ☆330Updated last week
- ☆239Updated 7 months ago
- 视频理解:千问视频多模态模型 & Dify☆64Updated last year
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆249Updated 3 weeks ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆365Updated 2 weeks ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆106Updated 2 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:面向金融行业的大模型)☆345Updated last month
- Taiyi-Diffusion-XL训练代码☆23Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- 使用FastAPI+vLLM部署Qwen2.5☆22Updated 11 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆190Updated last month
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆63Updated this week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆292Updated 2 months ago
- Chinese Stable Diffusion, zh SD,中文文生图, 中文SD,中文Stable Diffusion☆49Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 10 months ago
- ☆124Updated last month
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆62Updated 10 months ago
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆224Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆44Updated last week
- ☆20Updated last year
- HuggingFace 中文文档☆24Updated last year
- Chinese CLIP models with SOTA performance.☆58Updated 2 years ago
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆46Updated 4 months ago