oztrkoguz / VisQueryPDFLinks
It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Updated last year
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆42Updated 3 months ago
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆31Updated 7 months ago
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆12Updated last year
- ☆22Updated last year
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆74Updated 7 months ago
- Gradio Demo for ComfyDeploy☆55Updated last year
- For loading and running Pixtral models☆78Updated 10 months ago
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Updated last year
- Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.☆15Updated 2 years ago
- Cosmos1GP for the GPU Poor by DeepBeepMeep☆81Updated 9 months ago
- Deforum based on flux-dev by XLabs-AI☆223Updated last year
- An AI focused photo manipulation tool based on Gradio☆183Updated 5 months ago
- Image captioning using python and BLIP☆50Updated 2 years ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated last year
- Custom nodes for using fal API.☆163Updated last week
- ☆30Updated this week
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆102Updated last year
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆29Updated 2 years ago
- ☆28Updated last year
- Roboflow Workflows on ComfyUI☆33Updated last year
- 100% Local Document deep search with LLMs☆26Updated last year
- Run Local and API LLMs, Features Gemini2 image generation, DEEPSEEK R1, QwenVL2.5, QWQ32B, Ollama, LlamaCPP LMstudio, Koboldcpp, TextGen,…☆145Updated 7 months ago
- gradio web ui for musepose☆15Updated last year
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆67Updated 10 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆53Updated last year
- Unofficial implementation of PhotoMakerV2 for ComfyUI☆18Updated last year
- AI-powered video frame extraction tool that automatically identifies and extracts high-quality frames containing people, with intelligent…☆146Updated 5 months ago
- ☆25Updated last year
- ☆31Updated 7 months ago