It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Jun 29, 2024Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆12Jun 3, 2024Updated last year
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆32Apr 7, 2025Updated 11 months ago
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Jul 28, 2024Updated last year
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Jul 25, 2024Updated last year
- My ComfyUI workflows collection☆15Jan 18, 2025Updated last year
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆47Aug 2, 2025Updated 7 months ago
- https://emre.xyz/kendi-blogunu-kendin-kodla☆11May 18, 2022Updated 3 years ago
- Covid-19 aşı ile ilgili doğru bilginin yayılımı için açılmış bir web uygulamasıdır.☆14Apr 29, 2022Updated 3 years ago
- Image Upscaler with Tile Controlnet Fully Integrated in Huggingface Diffusers☆20Jan 11, 2026Updated 2 months ago
- ☆20Aug 7, 2024Updated last year
- Passive DNS Dataset of Domain Resolutions☆18Jun 14, 2022Updated 3 years ago
- Analyze a real-time IPv4 packet stream and export metrics about the data flows☆14Jan 29, 2020Updated 6 years ago
- Go hakkındaki edindiğim bilgileri temel ve anlaşılabilir bir düzeyde anlatmaya çalıştım. Eksik veya hatalı gördüğünüz kısımları belirtebi…☆17Oct 28, 2020Updated 5 years ago
- IOCTL-Flooder is a verbose tool designed to help with Windows driver fuzzing by brute forcing IOCTLs on loaded drivers. GetLastError is u…☆11Aug 21, 2018Updated 7 years ago
- SAGA: Spectral Adversarial Geometric Attack on 3D Meshes (ICCV 2023)☆25Sep 25, 2023Updated 2 years ago
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- Vulnerability Knowledge Base comparison tool☆13Feb 9, 2022Updated 4 years ago
- A wrapper script for https://sploitus.com to scrape query results for tools and exploits☆14Mar 3, 2019Updated 7 years ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated last year
- Excel MCP Server - Manipulate Excel files without Microsoft Excel. Model Context Protocol for XLSX, XLSM with Claude AI integration☆25Jun 18, 2025Updated 9 months ago
- A graphical sound editor which uses CSCore library for reading and playing sound files.☆11Jul 27, 2016Updated 9 years ago
- notes on applied computer security☆12Jun 27, 2023Updated 2 years ago
- Windows Forms controls for audio spectrum visualization, etc. using NAudio.☆10Feb 15, 2021Updated 5 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- a collection of tools and python libraries for use in medical practice and research☆11Nov 22, 2025Updated 3 months ago
- ✓ Using scapy and nmap tools, find out ip/port, ARP SPOOFING & TCP SESSION HIJACKING ✓ Using snort tool provided Intrusion Detection Syst…☆10Oct 29, 2019Updated 6 years ago
- C++ junior developer☆12Feb 15, 2026Updated last month
- Bu dizin Youtube eğitim serisi için yaratılmıştır.☆34Sep 17, 2020Updated 5 years ago
- Geosearch photo in vk.com☆13Mar 19, 2020Updated 6 years ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- Forecasting of forest drought impacts in Switzerland from satellite imagery, weather reanalysis and remote sensing data. Pixel-wise forec…☆15Sep 14, 2023Updated 2 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Oct 27, 2017Updated 8 years ago
- The classical snake game☆15Feb 23, 2023Updated 3 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- As society and technology develop, more and more of our time is spent online, from shopping to socialising, working to banking. Ensuring …☆10Oct 8, 2022Updated 3 years ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 10 months ago
- ☆19Jan 19, 2026Updated 2 months ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)☆16Oct 13, 2022Updated 3 years ago