oztrkoguz / VisQueryPDFLinks
It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing comprehensive information extraction and analysis.
☆12Updated last year
Alternatives and similar repositories for VisQueryPDF
Users that are interested in VisQueryPDF are comparing it to the libraries listed below
Sorting:
- This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.☆14Updated last year
- An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with …☆46Updated 5 months ago
- Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of …☆29Updated 2 years ago
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆20Updated last year
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆74Updated 9 months ago
- LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language☆20Updated last year
- ☆12Updated last year
- ☆23Updated last year
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆32Updated 9 months ago
- Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Pro…☆550Updated last month
- 100% Local Document deep search with LLMs☆26Updated last year
- Deforum based on flux-dev by XLabs-AI☆224Updated last year
- This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize t…☆12Updated last year
- automatically quant GGUF models☆219Updated 2 weeks ago
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆102Updated last year
- ☆127Updated last year
- Image identification with Kosmos2 model, drawing and cutting bbox with object detection☆16Updated last year
- A pipeline parallel training script for LLMs.☆165Updated 8 months ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆28Updated 7 months ago
- Code for "A Comprehensive Analysis of Static Word Embeddings for Turkish". Expert Systems with Applications 2024.☆27Updated 11 months ago
- Gradio Demo for ComfyDeploy☆55Updated last year
- Prompt-based Evolutionary Nudity Iteration System☆139Updated 5 months ago
- A convenient fast Text to Speech Whisper Speech by Collabora you can train a voice on the fly on ComfyUI☆44Updated 10 months ago
- ☆11Updated last year
- ☆13Updated last year
- Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.☆15Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- Image captioning using python and BLIP☆50Updated 2 years ago
- This repo contains codes covered in the youtube tutorials.☆87Updated 7 months ago
- An example repository to use HuggingFace smolagents, Phidata and CrewAI frameworks with local LLMs☆39Updated last year