CaraJ7 / MMSearch
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
☆407Updated last month
Alternatives and similar repositories for MMSearch:
Users that are interested in MMSearch are comparing it to the libraries listed below
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides: https://arxiv.org/abs/2501.03936☆196Updated this week
- 1st User Profile-Based Memory for GenAI Apps☆133Updated this week
- The simplest open-source implementation of perplexity.ai☆279Updated 4 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆194Updated 2 weeks ago
- Your first AI prompt engineer☆357Updated 2 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆103Updated this week
- ☆368Updated last month
- WebDesignAgent : Towards Effortless Website Creation☆243Updated 3 months ago
- A LLM-based Agent that predict its tasks proactively.☆277Updated last week
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆178Updated this week
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆231Updated last week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆205Updated this week
- ☆255Updated last month
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆174Updated last month
- An open-sourced end-to-end VLM-based GUI Agent☆513Updated last week
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 4 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated this week
- An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos…☆519Updated 6 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆163Updated 2 weeks ago
- Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completio…☆74Updated last month
- Medical o1, Towards medical complex reasoning with LLMs☆656Updated last week
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆172Updated 4 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆89Updated last month
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆214Updated 3 weeks ago
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆822Updated 6 months ago
- Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-…☆251Updated 6 months ago
- Parsing-free RAG supported by VLMs☆552Updated this week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆451Updated 2 weeks ago
- ☆356Updated last month