voyage-ai / voyage-multimodal-3Links
☆18Updated 7 months ago
Alternatives and similar repositories for voyage-multimodal-3
Users that are interested in voyage-multimodal-3 are comparing it to the libraries listed below
Sorting:
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆25Updated last week
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated 8 months ago
- ☆29Updated 9 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆47Updated 3 months ago
- The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"☆22Updated 4 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆19Updated 2 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 8 months ago
- GLM Series Edge Models☆142Updated 3 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- ☆35Updated 6 months ago
- ☆55Updated 6 months ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Updated last year
- Hybrid-RAG is a hybrid Retrieval-Augmented Generation (RAG) model that leverages BERT for retrieving relevant documents and GPT-2 for gen…☆29Updated 4 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆125Updated 7 months ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automati…☆23Updated last month
- ☆68Updated 8 months ago
- ☆13Updated 2 years ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆13Updated 3 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆22Updated last month
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 6 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 6 months ago
- Open source intent recognition framework powered by LLMs.☆19Updated 5 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆12Updated 3 months ago
- ☆32Updated 4 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago