oztrkoguz / VisQueryPDFLinks
It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Updated last year
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆36Updated last week
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆71Updated 4 months ago
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆13Updated last year
- LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language☆18Updated 8 months ago
- VoiceHub: A Unified Inference Interface for TTS Models☆46Updated last week
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆29Updated last year
- ☆12Updated last year
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆30Updated 4 months ago
- ☆21Updated last year
- An example repository to use HuggingFace smolagents, Phidata and CrewAI frameworks with local LLMs☆38Updated 7 months ago
- An AI focused photo manipulation tool based on Gradio☆185Updated last month
- A pipeline parallel training script for LLMs.☆153Updated 3 months ago
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Updated last year
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆60Updated 8 months ago
- ☆12Updated 7 months ago
- Gradio Demo for ComfyDeploy☆54Updated last year
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆99Updated last year
- ☆40Updated last year
- Deforum based on flux-dev by XLabs-AI☆222Updated 11 months ago
- Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.☆15Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆137Updated 10 months ago
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆19Updated 11 months ago
- Enhance faces in AI generated images☆47Updated last month
- Custom nodes for using fal API.☆140Updated 3 weeks ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆68Updated 9 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 3 months ago
- ☆22Updated 9 months ago
- ☆26Updated 4 months ago
- A Python toolkit for image clustering using deep learning, PCA, and K-means, with support for GPU and CPU processing. Simplify your image…☆37Updated last year