weareprestatech / hotpdf
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
☆185Updated 3 months ago
Alternatives and similar repositories for hotpdf:
Users that are interested in hotpdf are comparing it to the libraries listed below
- 90% of what you need for LLM app development. Nothing you don't.☆252Updated this week
- A Lightweight Library for LLM I/O☆114Updated 2 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆169Updated 9 months ago
- Virtual environment stacks for Python☆237Updated last week
- Structured information extraction from documents☆312Updated 6 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆221Updated 3 months ago
- Extremely memory-efficient vector database☆66Updated 6 months ago
- Generate python documentation using LLMs☆63Updated 9 months ago
- DocFlow is a powerful Document Management API designed to streamline document handling, including seamless uploading, downloading, organi…☆139Updated 3 months ago
- The LLM library for the Agent era.☆24Updated last week
- Query CSV, JSON and Parquet files with SQL☆108Updated 9 months ago
- Unattended Lightweight Text Classifiers with LLM Embeddings☆184Updated 6 months ago
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated 8 months ago
- Experiment and integrate with different OCR frameworks seamlessly☆104Updated 11 months ago
- ☆48Updated last year
- A prompting library☆155Updated 6 months ago
- Unbelievably fast async webframework, proudly written in python, offering high-level development, low-level performance, multiplying 0.1x…☆87Updated this week
- simplifies the process of creating and managing LLM workflows.☆99Updated 5 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆141Updated 11 months ago
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆179Updated 3 weeks ago
- GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.☆74Updated 2 weeks ago
- A pythonic library providing light-weighted interface with LLMs☆125Updated 5 months ago
- ☆176Updated 2 weeks ago
- TF-ID: Table/Figure IDentifier for academic papers☆230Updated 8 months ago
- The easiest way to ship python applications.☆197Updated 4 months ago
- A simple Python program to implement the search-extract-summarize flow.☆258Updated 2 months ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆21Updated last year
- A Python library for verifying code properties using natural language assertions.☆31Updated last month
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆282Updated last week
- A simple tool that let's you explore different possible paths that an LLM might sample.☆134Updated last week