UnicomAI / UniT2IXLLinks
☆56Updated last year
Alternatives and similar repositories for UniT2IXL
Users that are interested in UniT2IXL are comparing it to the libraries listed below
Sorting:
- ☆187Updated last year
- ☆341Updated 4 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- hf-mirror-cli 使用国内镜像,无需配置开箱即用,快速下载hugingface上的模型☆151Updated last year
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆284Updated 4 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆270Updated 3 weeks ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆124Updated 2 weeks ago
- 使用FastAPI+vLLM部署Qwen2.5☆25Updated last year
- ☆79Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆56Updated 2 months ago
- ☆348Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆79Updated last year
- GLM Series Edge Models☆158Updated 7 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆58Updated last year
- ☆242Updated 11 months ago
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆241Updated 3 months ago
- Taiyi-Diffusion-XL训练代码☆23Updated last year
- 视频理解:千问视频多模态模型 & Dify☆66Updated last year
- ☆29Updated last year
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)☆420Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46Updated 4 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆299Updated 7 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆526Updated 5 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆197Updated last week
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 中文论文、证券类、财报类PDF数据☆36Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- Chinese CLIP models with SOTA performance.☆60Updated 2 years ago