oztrkoguz / VisQueryPDFLinks
It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Updated last year
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆12Updated last year
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆22Updated last year
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆32Updated 10 months ago
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆46Updated 6 months ago
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆29Updated 2 years ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆76Updated 10 months ago
- VoiceHub: A Unified Inference Interface for TTS Models☆62Updated last month
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Updated last year
- 100% Local Document deep search with LLMs☆26Updated last year
- Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.☆15Updated 2 years ago
- Run Ollama LLM models in Google Colab for free☆37Updated last year
- ☆12Updated last year
- ☆23Updated last year
- This repo contains Lyra AI's work in the E-Commerce Hackathon organized by Trendyol and Teknofest.☆13Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Updated last year
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆54Updated last year
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆102Updated last year
- ☆40Updated last year
- This repo contains codes covered in the youtube tutorials.☆87Updated 8 months ago
- multilingual RAG☆16Updated 2 years ago
- ☆12Updated last year
- Gradio Demo for ComfyDeploy☆55Updated last year
- Roboflow Workflows on ComfyUI☆33Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆108Updated last month
- ☆29Updated 2 years ago
- ☆25Updated 2 years ago
- Unofficial implementation of PhotoMakerV2 for ComfyUI☆19Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23Updated 9 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆16Updated 11 months ago