CaraJ7 / MMSearch
[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
☆414Updated 3 weeks ago
Alternatives and similar repositories for MMSearch:
Users that are interested in MMSearch are comparing it to the libraries listed below
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆234Updated 3 weeks ago
- 🌐 WebWalker: Benchmarking LLMs in Web Traversal☆318Updated 2 weeks ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆121Updated this week
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆207Updated last month
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆237Updated last week
- WebDesignAgent : Towards Effortless Website Creation☆246Updated 5 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆212Updated last week
- Your first AI prompt engineer☆361Updated 3 months ago
- The simplest open-source implementation of perplexity.ai☆295Updated 3 weeks ago
- A LLM-based Agent that predict its tasks proactively.☆299Updated last month
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆390Updated last week
- ☆393Updated this week
- An open-sourced end-to-end VLM-based GUI Agent☆753Updated this week
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆186Updated this week
- Semantic Search on Wikipedia with Upstash Vector☆447Updated last month
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆223Updated 2 weeks ago
- ☆170Updated 2 weeks ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated last month
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆465Updated last month
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆829Updated 7 months ago
- Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completio…☆78Updated 2 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆129Updated 2 months ago
- Trans Router☆152Updated last month
- Profile-Based Long-Term Memory for AI Applications☆551Updated this week
- Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-…☆262Updated 8 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆45Updated 6 months ago