pdf multimodal rag 【pdf多模态rag问答】
☆27Feb 26, 2025Updated last year
Alternatives and similar repositories for pdf_multimodal_rag
Users that are interested in pdf_multimodal_rag are comparing it to the libraries listed below
Sorting:
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆31Feb 10, 2026Updated 3 weeks ago
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆10Dec 21, 2025Updated 2 months ago
- 一个基于FastAPI和React的智能体系统,支持多智能体管理、mcp管理、知识库、聊天对话等功能。An intelligent agent system based on FastAPI and React, supporting multi-agent managem…☆21Jan 25, 2026Updated last month
- 本项目主要介绍prompt工程相关用例。包括模拟智能推荐客服系统构建和问答、思维链、自洽性、思维树等相关进阶demo,旨在帮助大家理解prompt。通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)并使用FastAPI对应用进行API封装。☆52Sep 26, 2024Updated last year
- ☆10Jun 28, 2023Updated 2 years ago
- TABLE DETECTION IN IMAGES AND OCR TO CSV WITH JAVA☆10Jul 18, 2023Updated 2 years ago
- ☆11Oct 31, 2024Updated last year
- ppt转数字人后台☆18Apr 9, 2025Updated 11 months ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- ☆11Mar 22, 2024Updated last year
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated 11 months ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- Eagle and EagleSim: Deep-RL for PTZ Cameras☆10Aug 23, 2024Updated last year
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- ☆14Feb 9, 2026Updated last month
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 10 months ago
- 毕业设计:互联网新闻热点抽取系统☆10May 21, 2022Updated 3 years ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- Robust and Memory Efficient Event Detection and Tracking in Large News Feeds☆13Oct 15, 2021Updated 4 years ago
- 基于新浪微博的面向食品安全的舆情话题检测与追踪系统☆13Jul 6, 2022Updated 3 years ago
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆13Mar 26, 2025Updated 11 months ago
- Implementation of a histogram equalization program using CUDA. Histogram equalization is a technique for adjusting image intensities to e…☆13Jan 3, 2021Updated 5 years ago
- cutile kernel examples☆39Feb 6, 2026Updated last month
- Generate xml documentaton comment stubs for c++ when three forward slashes are typed☆13Mar 21, 2018Updated 7 years ago
- TensorRT half precision inference routine on a API-based TensorRT model☆13Jul 3, 2018Updated 7 years ago
- The code for LexDrafter framework: a framework that assists in drafting Definitions articles for legislative documents using retrieval au…☆13May 12, 2025Updated 9 months ago
- 生成中文文字识别(OCR)的训练数据☆12Mar 2, 2020Updated 6 years ago
- Ip/Web camera stream viewer and recorder with QT☆12Apr 20, 2020Updated 5 years ago
- paper-read-notes☆12Sep 26, 2024Updated last year
- 基于ChatGLM3-6b的智能对话系统,集成了RAG、知识图谱、Agent、多模态等技术来增强大模型的回复质量。☆63Aug 12, 2024Updated last year
- Underwater Object Detection Kesci大赛项目:全国水下机器人大赛 - 水下目标检测算法赛☆13Apr 13, 2020Updated 5 years ago
- Explore cutting-edge Redis capabilities for Vector Similarity Search, Hybrid Search (Vector Similarity + Meta Search), Semantic Caching, …☆16Jan 21, 2024Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆13Aug 11, 2020Updated 5 years ago
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆16Jul 22, 2024Updated last year
- ☆17Dec 1, 2023Updated 2 years ago
- ☆12Oct 23, 2021Updated 4 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- ☆12Nov 27, 2019Updated 6 years ago