pdfLLM is a completely open source, proof of concept RAG app.
☆187Sep 1, 2025Updated 10 months ago
Alternatives and similar repositories for pdfLLM
Users that are interested in pdfLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆51Jan 26, 2026Updated 5 months ago
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆27Jun 7, 2025Updated last year
- Find audiobooks missing from a series you own. This works for Audible series only.☆45Jun 15, 2026Updated 2 weeks ago
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆79Jun 22, 2026Updated last week
- LLM playground to experiment with local models and build fine-tuning datasets and benchmarks☆52Mar 11, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 10 months ago
- ☆54May 11, 2026Updated last month
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆84Dec 16, 2025Updated 6 months ago
- Datu Core AI Analyst open-source☆42Sep 14, 2025Updated 9 months ago
- Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle…☆56Apr 23, 2026Updated 2 months ago
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆86Aug 16, 2025Updated 10 months ago
- GrantFlow.ai is a platform for creating grant applications using ML and AI☆57Mar 20, 2026Updated 3 months ago
- We believe that every SOTA result is only valid on its own dataset. RAGView provides a unified evaluation platform to benchmark different…☆79Dec 5, 2025Updated 6 months ago
- ☆37Jul 10, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This workshop covers the entire process of using Milvus—from installation and basic concepts to core operations and practical application…☆34Jun 4, 2026Updated 3 weeks ago
- Bot that monitors desirable skins and will automatically buy them☆21May 15, 2023Updated 3 years ago
- RLVR Testing and Training☆22Aug 28, 2025Updated 10 months ago
- MyLinks shows bookmarks organised in widgets☆10Jun 22, 2025Updated last year
- ☆13Feb 3, 2025Updated last year
- hentai game manager, mostly for f95, but might add other sites later☆13Jun 16, 2026Updated 2 weeks ago
- My attempt at implementing retreival augmented generation on Ollama and other LLM services using chromadb and langchain while also provid…☆52Oct 12, 2025Updated 8 months ago
- Find your files with natural language and ask questions.☆60May 27, 2026Updated last month
- A text-grid web renderer for AI agents — see the web without screenshots☆100Mar 10, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- MCP -> CLI + SKILLs. Maintain SKILL pararllel with MCP Server.☆62Apr 13, 2026Updated 2 months ago
- The most accurate document search and store for building AI apps☆3,621Jun 19, 2026Updated last week
- Prometheus Exporter for Ollama☆45Jul 23, 2025Updated 11 months ago
- Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website Crawler☆74Feb 19, 2026Updated 4 months ago
- Self-hosted video downloader with metadata collection built on Python☆19Mar 11, 2025Updated last year
- Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.☆29Sep 25, 2025Updated 9 months ago
- PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation☆3,002Updated this week
- Your AI mate into your favourite terminal☆149May 9, 2026Updated last month
- A microservice for real-time TradingView OHLCV data streaming via WebSocket, with dynamic subscriptions, Prometheus metrics, and proxy s…☆46Jul 21, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Nov 5, 2025Updated 7 months ago
- A lightweight Docker and LXC container manager with group automation, start-on-access, and real-time status monitoring.☆93Dec 27, 2025Updated 6 months ago
- Secure SSO portal for Plex, Jellyfin, and Emby users to access internal services, with RBAC, built-in LDAP sync, OAuth/OIDC, audit logs, …☆97Jun 22, 2026Updated last week
- ☆1,420Jun 23, 2026Updated last week
- WebForms is a versatile tool for creating HTML UI forms with backend support, offering flexibility, security, and integration capabilitie…☆12Nov 2, 2023Updated 2 years ago
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆540Jun 24, 2026Updated last week