Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision
☆320Jan 25, 2026Updated last month
Alternatives and similar repositories for Awesome-RAG-Vision
Users that are interested in Awesome-RAG-Vision are comparing it to the libraries listed below
Sorting:
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆1,051Jan 30, 2026Updated last month
- The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).☆22Aug 2, 2025Updated 7 months ago
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆123Dec 6, 2025Updated 3 months ago
- The official repo for "Unified Domain Adaptive Semantic Segmentation" (IEEE TPAMI 2025)☆33Aug 14, 2025Updated 6 months ago
- Fetch arxiv data to LLM-friendly text☆130Feb 18, 2026Updated 2 weeks ago
- Large Language Model in Action☆342Jan 28, 2025Updated last year
- 基于chatgpt-next-web 增强版本,后台管理,接入知识库等。将按需持续接入midjourney绘画功能,接入了stable-diffusion,支持oss,支持dall-e-3、gpt-4-vision-preview、whisper、tts,支持gpt-4-a…☆38May 4, 2024Updated last year
- Reading list for multimodal sequence learning☆14Sep 4, 2023Updated 2 years ago
- ☆169Oct 31, 2024Updated last year
- ☆30Jul 21, 2025Updated 7 months ago
- Focused Papers, Delivered Simply :)☆51Dec 25, 2025Updated 2 months ago
- This is the official repository for the paper titled "Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a M…☆16Apr 29, 2025Updated 10 months ago
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆33Dec 21, 2023Updated 2 years ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆45Sep 27, 2025Updated 5 months ago
- Automated skill creation workshop for Claude Code☆38Nov 14, 2025Updated 3 months ago
- ☆13Feb 19, 2025Updated last year
- ☆36Sep 25, 2024Updated last year
- The official GitHub page for the survey paper "A Survey on LLM Symbolic Reasoning". And this paper is under review.☆23Feb 15, 2026Updated 3 weeks ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Nov 20, 2025Updated 3 months ago
- AI 味去除 - 仅在 Gemini 2.5 Pro 上测试通过☆923Apr 2, 2025Updated 11 months ago
- ☆42Nov 19, 2025Updated 3 months ago
- ☆15Aug 20, 2024Updated last year
- ☆499Oct 11, 2025Updated 4 months ago
- ☆51Apr 11, 2025Updated 10 months ago
- Learn about the fundamentals of LangGraph through a series of notebooks☆331Updated this week
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆26Dec 19, 2025Updated 2 months ago
- Elaina is a wavefront implementation of walk on stars. (Code for SIGGRAPH 2025 paper "Guiding-Based Importance Sampling for Walk on Stars…☆27Oct 7, 2025Updated 5 months ago
- 遇事不决,Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!☆2,656May 8, 2025Updated 10 months ago
- Reproduction of DeepSeek-R1☆242Apr 14, 2025Updated 10 months ago
- PDF2MD是一个高效的PDF到Markdown转换工具,旨在帮助用户轻松将PDF文档转换为Markdown格式,便于编辑、分享和发布。通过简洁易用的界面和强大的转换功能,PDF2MD成为内容创作者、研究人员和开发者的得力助手。☆177Oct 11, 2025Updated 4 months ago
- [NeurIPS'24] Protecting Your LLMs with Information Bottleneck☆26Nov 7, 2024Updated last year
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆96Nov 20, 2025Updated 3 months ago
- OpenAI compatible /chat/completions endpoint for fal.ai☆50Nov 27, 2025Updated 3 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆98Jun 4, 2025Updated 9 months ago
- 智能视频处理系统☆48Dec 26, 2024Updated last year
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆26Jan 9, 2026Updated 2 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- ☆26Jun 4, 2025Updated 9 months ago
- 将所有AI 产品接入你的微信,打造你个人AI 助理,帮助你解决更多生活日常。☆407Apr 25, 2025Updated 10 months ago