pdfLLM is a completely open source, proof of concept RAG app.
☆187Sep 1, 2025Updated 6 months ago
Alternatives and similar repositories for pdfLLM
Users that are interested in pdfLLM are comparing it to the libraries listed below
Sorting:
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆46Jan 26, 2026Updated last month
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Jun 7, 2025Updated 9 months ago
- Find audiobooks missing from a series you own. This works for Audible series only.☆23Feb 2, 2026Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44May 24, 2025Updated 9 months ago
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆65Mar 13, 2026Updated last week
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 7 months ago
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆82Dec 16, 2025Updated 3 months ago
- Datu Core AI Analyst open-source☆42Sep 14, 2025Updated 6 months ago
- Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle…☆52Nov 25, 2025Updated 3 months ago
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆80Aug 16, 2025Updated 7 months ago
- GrantFlow.ai is a platform for creating grant applications using ML and AI☆53Updated this week
- Kick is an AI-powered assistant that provides voice and keyboard control over your Windows device, enabling seamless automation of your d…☆16Jul 29, 2025Updated 7 months ago
- A project containing a Cloudflare Worker for secure email forwarding and sanitization. It integrates email-alias-core and email-scrubber-…☆27Updated this week
- My attempt at implementing retreival augmented generation on Ollama and other LLM services using chromadb and langchain while also provid…☆50Oct 12, 2025Updated 5 months ago
- ETL project to download and process both CME open interest data, COT data from the CFTC and NAV/shares-outstanding data from various ETF …☆12Jul 13, 2021Updated 4 years ago
- hentai game manager, mostly for f95, but might add other sites later☆11Nov 4, 2025Updated 4 months ago
- Prometheus Exporter for Ollama☆35Jul 23, 2025Updated 7 months ago
- The most accurate document search and store for building AI apps☆3,541Feb 25, 2026Updated 3 weeks ago
- Turning 9-to-5ers into Algo-Traders. Official code from Instagram @quant.traderr☆32Mar 15, 2026Updated last week
- Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website Crawler☆74Feb 19, 2026Updated last month
- Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.☆26Sep 25, 2025Updated 5 months ago
- ☆10Nov 30, 2024Updated last year
- Your AI mate into your favourite terminal☆136Mar 7, 2026Updated 2 weeks ago
- ComfyUI-Downloader☆33Jan 19, 2026Updated 2 months ago
- A microservice for real-time TradingView OHLCV data streaming via WebSocket, with dynamic subscriptions, Prometheus metrics, and proxy s…☆39Jul 21, 2025Updated 8 months ago
- Authentication portal that gives Plex, Jellyfin, and Emby users secure single sign-on to internal services. v2.0.4 adds a modular admin c…☆90Updated this week
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- A lightweight Docker and LXC container manager with group automation, start-on-access, and real-time status monitoring.☆93Dec 27, 2025Updated 2 months ago
- ☆24Apr 4, 2025Updated 11 months ago
- Run and monitor MCP servers locally☆211Aug 16, 2025Updated 7 months ago
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Nov 5, 2025Updated 4 months ago
- The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, …☆1,374Updated this week
- ☆1,290Updated this week
- PVMSS is a lightweight, self-service web portal for Proxmox Virtual Environment. It allows users to create and manage virtual machines wi…☆39Updated this week
- PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation☆2,717Updated this week
- ☆65Jan 7, 2026Updated 2 months ago
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆498Updated this week
- Mirrored from https://codeberg.org/kramo/CommitEdit☆35Oct 12, 2025Updated 5 months ago
- WebForms is a versatile tool for creating HTML UI forms with backend support, offering flexibility, security, and integration capabilitie…☆12Nov 2, 2023Updated 2 years ago