CaraJ7 / MMSearch
[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
☆431Updated 3 months ago
Alternatives and similar repositories for MMSearch:
Users that are interested in MMSearch are comparing it to the libraries listed below
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆270Updated 3 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆219Updated last week
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆232Updated 2 months ago
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆446Updated 2 weeks ago
- ☆239Updated 8 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆163Updated last week
- 🌐 WebWalker: Benchmarking LLMs in Web Traversal☆390Updated last week
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆462Updated last month
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 3 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆117Updated 2 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆144Updated 7 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆78Updated last month
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆144Updated last week
- Your first AI prompt engineer☆376Updated 6 months ago
- The simplest open-source implementation of perplexity.ai☆309Updated 3 months ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations☆193Updated 3 weeks ago
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆73Updated last week
- WebDesignAgent : Towards Effortless Website Creation☆250Updated 7 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆310Updated 2 weeks ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆247Updated 2 months ago
- Collect every awesome work about r1!☆356Updated last week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 4 months ago
- Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆468Updated last month
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆162Updated 2 months ago
- Conversational Retrieval Evaluation Dataset☆100Updated 2 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆617Updated 2 months ago
- ☆156Updated 6 months ago
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆214Updated 5 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆209Updated 3 weeks ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆182Updated 2 weeks ago