oztrkoguz / VisQueryPDFLinks
It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Updated last year
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆41Updated 3 months ago
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆31Updated 7 months ago
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆29Updated last year
- LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language☆19Updated 11 months ago
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆12Updated last year
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆73Updated 7 months ago
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Updated last year
- ☆12Updated last year
- ☆22Updated last year
- 100% Local Document deep search with LLMs☆26Updated last year
- Wan2.1, quantized and optimized so it fits on your 3090/4090☆34Updated 8 months ago
- ☆25Updated last year
- ☆13Updated 10 months ago
- ☆12Updated 10 months ago
- built a 124M param GPT☆22Updated 9 months ago
- For loading and running Pixtral models☆77Updated 9 months ago
- This repo contains Lyra AI's work in the E-Commerce Hackathon organized by Trendyol and Teknofest.☆13Updated last year
- ☆16Updated last year
- Roboflow Workflows on ComfyUI☆33Updated last year
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆20Updated last year
- ☆22Updated last year
- Gradio Demo for ComfyDeploy☆55Updated last year
- A custom node extension for ComfyUI that integrates Google's Veo 2 text-to-video generation capabilities.☆30Updated 6 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆15Updated 8 months ago
- Gen AI based travel assistant for Turkish Airlines customers☆11Updated last year
- A pipeline parallel training script for LLMs.☆161Updated 6 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆53Updated last year
- ☆22Updated last year
- Prompt-based Evolutionary Nudity Iteration System☆136Updated 3 months ago