PDFStract - Extract, Chunking and Embedding Layer in Your RAG Pipeline - Available as CLI - WEBUI - API
☆133Mar 18, 2026Updated last week
Alternatives and similar repositories for pdfstract
Users that are interested in pdfstract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Jun 7, 2025Updated 9 months ago
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆32Sep 7, 2025Updated 6 months ago
- Reverse-engineered Perplexity API client in Python. Facilitates WebSocket communication for real-time AI responses, maintaining session i…☆27May 9, 2024Updated last year
- Simple, lightweight CRM application built on the Power Platform☆11Jun 5, 2020Updated 5 years ago
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆65Mar 13, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- House of the Paiperwork☆34Updated this week
- An open source real-time AI inference engine for seamless scaling☆22Jul 2, 2025Updated 8 months ago
- ☆24Apr 4, 2025Updated 11 months ago
- Dimple is a cross-platform, open source, local first, private music player.☆19Nov 1, 2025Updated 4 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Nov 25, 2024Updated last year
- Layout and custom fields for DanePubliczne.gov.pl☆10Dec 7, 2022Updated 3 years ago
- DCAT-AP-DK er en dansk anvendelsesprofil til beskrivelse af datasæt og datakataloger☆10Feb 17, 2026Updated last month
- 🌪️ AI research assistant that generates Wikipedia-quality articles through multi-perspective analysis. Based on Stanford's STORM methodo…☆54Jun 6, 2025Updated 9 months ago
- A Dockerfile linter using Hadolint for Github actions that provides code annotations, Github advanced security and more☆14Feb 8, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation☆2,738Updated this week
- 🐼🌹 A simple Pandas accessor for making windrose plots.☆15Mar 20, 2026Updated last week
- Python script that can be used to generate latitude/longitude coordinates for GOES-16 full-disk extent.☆10Jan 26, 2022Updated 4 years ago
- PublicaMundi main CKAN extension☆13Jul 30, 2024Updated last year
- pysat support for space weather indices and data sets☆14Oct 21, 2025Updated 5 months ago
- screenshot OCR server☆17Updated this week
- A powerful, yet simple to use, self-hosted redirect service☆40Updated this week
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆80Aug 16, 2025Updated 7 months ago
- Custom launcher for Claude Code, supporting dynamic prompts, layered configuration and easy custom hooks and MCPs.☆16Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- DE: Dieses Repository beinhaltet die Erweiterung von ckanext-dcat auf die Spezifikation DCAT-AP.de. EN: This is a DCAT-AP.de specific CKA…☆11Updated this week
- ☆20Sep 6, 2025Updated 6 months ago
- ☆22Mar 9, 2026Updated 3 weeks ago
- SHACL-Shapes für DCAT-AP.de☆13Mar 2, 2026Updated 3 weeks ago
- ☆13Jan 15, 2017Updated 9 years ago
- Sophos UTM 9 REST API Client in Golang☆12May 6, 2022Updated 3 years ago
- Class for reading NEXRAD Level 3 files in Python☆12Mar 29, 2015Updated 11 years ago
- ☆16Jun 27, 2025Updated 9 months ago
- a golang implementation of gauntlt☆12Oct 25, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- pdfLLM is a completely open source, proof of concept RAG app.☆187Sep 1, 2025Updated 6 months ago
- Tutorial for accessing ERA5 data on AWS for use in running the Weather Research and Forecasting (WRF) model☆14May 23, 2024Updated last year
- ☆30Oct 4, 2024Updated last year
- Symfony bundle that wrap sapient library☆11Jun 19, 2022Updated 3 years ago
- Seamlessly bridge Lidarr and YouTube. Automatically fetch missing albums, download them as fully tagged MP3s, and trigger instant Lidarr …☆24Updated this week
- Cursor IDE like Pro☆17Apr 29, 2025Updated 11 months ago
- Zvuk (Звук) grabber written in Go. This tool allows you to download artists, albums, tracks, and playlists from Zvuk.☆17Mar 18, 2026Updated last week