voyage-ai / voyage-multimodal-3
☆16Updated 4 months ago
Alternatives and similar repositories for voyage-multimodal-3:
Users that are interested in voyage-multimodal-3 are comparing it to the libraries listed below
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 6 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆20Updated last year
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆14Updated 10 months ago
- ☆21Updated 7 months ago
- MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval☆131Updated 2 weeks ago
- Xtuner Factory☆33Updated last year
- ☆78Updated 10 months ago
- ☆29Updated 7 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆22Updated 8 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆49Updated last year
- Search, organize, discover anything!☆48Updated 11 months ago
- Chinese CLIP models with SOTA performance.☆54Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆82Updated 2 months ago
- ☆56Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 5 months ago
- Our 2nd-gen LMM☆33Updated 10 months ago
- ☆26Updated 5 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆52Updated 4 months ago
- ThinkLLM:大语言模型算法与组件实现☆27Updated last week
- ☆17Updated 9 months ago
- A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.☆46Updated 2 weeks ago
- GLM Series Edge Models☆131Updated last month
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- ☆67Updated last year
- ☆18Updated last month
- SUS-Chat: Instruction tuning done right☆48Updated last year
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆171Updated last month
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆285Updated 2 weeks ago
- 通过阿里云盘,colab,国内下载huggingface大模型轻轻松松☆36Updated 9 months ago
- ☆31Updated 2 months ago