UnicomAI / UniT2IXLLinks
☆57Updated last year
Alternatives and similar repositories for UniT2IXL
Users that are interested in UniT2IXL are comparing it to the libraries listed below
Sorting:
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- ☆187Updated 11 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆269Updated last week
- ☆341Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46Updated 4 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆283Updated 4 months ago
- ☆79Updated last year
- ☆348Updated last year
- ☆242Updated 11 months ago
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Updated last year
- hf-mirror-cli 使用国内镜像,无需配置开箱即用,快速下载hugingface上的模型☆151Updated 11 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆58Updated last year
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆125Updated this week
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆196Updated 5 months ago
- 视频理解:千问视频多模态模型 & Dify☆66Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- GLM Series Edge Models☆156Updated 7 months ago
- Chinese CLIP models with SOTA performance.☆60Updated 2 years ago
- 使用FastAPI+vLLM部署Qwen2.5☆25Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 8 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆299Updated 7 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆509Updated 4 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆54Updated 2 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散 模型支持多语言能力)☆146Updated last year
- 视频分类标注、视频时空标注☆44Updated 2 years ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)☆418Updated last week
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆242Updated last month
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆241Updated 2 months ago
- HuggingFace 中文文档☆25Updated last year