PDFStract - Extract, Chunking and Embedding Layer in Your RAG Pipeline - Available as CLI - WEBUI - API
☆142Mar 18, 2026Updated last month
Alternatives and similar repositories for pdfstract
Users that are interested in pdfstract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Jun 7, 2025Updated 10 months ago
- Find your files with natural language and ask questions.☆58Mar 21, 2026Updated 3 weeks ago
- Efficient MCP tool calling in code mode for Claude Code☆23Dec 12, 2025Updated 4 months ago
- Reverse-engineered Perplexity API client in Python. Facilitates WebSocket communication for real-time AI responses, maintaining session i…☆26May 9, 2024Updated last year
- Connect and dynamically manage multiple MCP servers/tools through a single SSE interface, allowing your AI agent or AI APP to control MCP…☆17May 22, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆69Updated this week
- An open source real-time AI inference engine for seamless scaling☆22Jul 2, 2025Updated 9 months ago
- ☆24Apr 4, 2025Updated last year
- A collection of hardware Trojans (HTs) automatically generated by Large Language Models (GPT-4, Gemini-1.5-pro, and LLaMA3) targeting SRA…☆11Oct 8, 2025Updated 6 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆104Nov 25, 2024Updated last year
- Mapping and monitoring of infrastructure in desert regions with Sentinel-1☆15Feb 9, 2026Updated 2 months ago
- PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation☆2,811Updated this week
- 🐼🌹 A simple Pandas accessor for making windrose plots.☆15Mar 20, 2026Updated 3 weeks ago
- Python script that can be used to generate latitude/longitude coordinates for GOES-16 full-disk extent.☆10Jan 26, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- pysat support for space weather indices and data sets☆14Apr 3, 2026Updated 2 weeks ago
- Known Electromagnetic Radiation Mapping and Identification Toolkit (KERMIT) - Map where RF signal is strongest☆14May 17, 2025Updated 11 months ago
- screenshot OCR server☆17Mar 25, 2026Updated 3 weeks ago
- A powerful, yet simple to use, self-hosted redirect service☆40Mar 27, 2026Updated 3 weeks ago
- A blazingly fast microservice for matching ROM file hashes and caching game metadata. Originally designed for RetroRealm.☆23Jul 28, 2025Updated 8 months ago
- ☆27Sep 11, 2025Updated 7 months ago
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆83Aug 16, 2025Updated 8 months ago
- Topographic map retrieval from 3DEP☆18Apr 13, 2026Updated last week
- Custom launcher for Claude Code, supporting dynamic prompts, layered configuration and easy custom hooks and MCPs.☆16Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 6 months ago
- ☆20Sep 6, 2025Updated 7 months ago
- Class for reading NEXRAD Level 3 files in Python☆13Mar 29, 2015Updated 11 years ago
- ☆16Jun 27, 2025Updated 9 months ago
- pdfLLM is a completely open source, proof of concept RAG app.☆186Sep 1, 2025Updated 7 months ago
- Tutorial for accessing ERA5 data on AWS for use in running the Weather Research and Forecasting (WRF) model☆14May 23, 2024Updated last year
- An open-source, self-hosted app to track your CD and vinyl collection☆41Apr 12, 2026Updated last week
- ☆30Oct 4, 2024Updated last year
- ☆14Apr 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Chrome extension that provides comprehensive browser fingerprint protection by defending against various tracking techniques used across …☆24Oct 26, 2025Updated 5 months ago
- This library aims to simplify any process working with different sets of EO data handled by EOReader☆13Apr 7, 2026Updated last week
- Open-source and extensible radar mosaic creation in Python☆17Dec 20, 2022Updated 3 years ago
- Docker container to tag your music with MusicBrainz Picard, right from your browser.☆42Apr 11, 2026Updated last week
- This Python script automates the retrieval and visualization of tropospheric NO2 data from Sentinel-5P satellite's TROPOMI instrument, en…☆12Jun 18, 2025Updated 10 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- Tools for geospatial analysis of gridded and ungridded lightning fields☆12Mar 8, 2017Updated 9 years ago