jiangnanboy / pdf_multimodal_ragView external linksLinks
pdf multimodal rag 【pdf多模态rag问答】
☆25Feb 26, 2025Updated 11 months ago
Alternatives and similar repositories for pdf_multimodal_rag
Users that are interested in pdf_multimodal_rag are comparing it to the libraries listed below
Sorting:
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆30Jan 31, 2026Updated 2 weeks ago
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆10Dec 21, 2025Updated last month
- ☆11Oct 31, 2024Updated last year
- 增加了indextts2的简单的界面与api调用方式☆20Oct 27, 2025Updated 3 months ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- ☆14Feb 5, 2026Updated last week
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 9 months ago
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- Eagle and EagleSim: Deep-RL for PTZ Cameras☆10Aug 23, 2024Updated last year
- ☆11Mar 22, 2024Updated last year
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智 能问答式搜索体验☆13Mar 26, 2025Updated 10 months ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated last year
- Robust and Memory Efficient Event Detection and Tracking in Large News Feeds☆13Oct 15, 2021Updated 4 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- Flash Attention in ~100 lines of CUDA (forward pass only)☆11Jun 10, 2024Updated last year
- textcnn for advertising detection,广告检测☆11Jan 12, 2024Updated 2 years ago
- ☆11Jan 2, 2022Updated 4 years ago
- Unofficial implementation of Variational Diffusion Models in PyTorch (Lightning)☆11Aug 31, 2023Updated 2 years ago
- Implementation of a histogram equalization program using CUDA. Histogram equalization is a technique for adjusting image intensities to e…☆13Jan 3, 2021Updated 5 years ago
- Generate xml documentaton comment stubs for c++ when three forward slashes are typed☆13Mar 21, 2018Updated 7 years ago
- Ip/Web camera stream viewer and recorder with QT☆12Apr 20, 2020Updated 5 years ago
- The code for LexDrafter framework: a framework that assists in drafting Definitions articles for legislative documents using retrieval au…☆13May 12, 2025Updated 9 months ago
- TensorRT half precision inference routine on a API-based TensorRT model☆13Jul 3, 2018Updated 7 years ago
- 深度网络实现意图分类。☆11Feb 26, 2021Updated 4 years ago
- paper-read-notes☆13Sep 26, 2024Updated last year
- 生成中文文字识别(OCR)的训练数据☆12Mar 2, 2020Updated 5 years ago
- Underwater Object Detection Kesci大赛项目:全国水下机器人大赛 - 水下目标检测算法赛☆13Apr 13, 2020Updated 5 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆13Aug 11, 2020Updated 5 years ago
- Official implementation of Character Region Awareness for Text Detection (CRAFT)☆15Jan 14, 2025Updated last year
- ☆14Jul 20, 2020Updated 5 years ago
- ☆12Nov 27, 2019Updated 6 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆16Jul 22, 2024Updated last year
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- ☆17Dec 1, 2023Updated 2 years ago
- A fully functional knowledge base platform offering robust content management, AI-powered intelligent Q&A, and a modern user experience 一…☆24Updated this week
- ☆13Dec 28, 2021Updated 4 years ago