weareprestatech / hotpdf
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
☆178Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for hotpdf
- A Lightweight Library for LLM I/O☆105Updated 2 weeks ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆166Updated 5 months ago
- ☆48Updated this week
- GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.☆61Updated 8 months ago
- Virtual environment stacks for Python☆156Updated this week
- The easiest way to ship python applications.☆193Updated 4 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆133Updated this week
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆71Updated 4 months ago
- A prompting library☆128Updated last month
- Generate ideal question-answers for testing RAG☆123Updated 4 months ago
- simplifies the process of creating and managing LLM workflows.☆84Updated last month
- Generate python documentation using LLMs☆56Updated 4 months ago
- Extract structured text from pdfs quickly☆342Updated this week
- Unattended Lightweight Text Classifiers with LLM Embeddings☆174Updated 2 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆269Updated 2 months ago
- A full-stack framework that integrates FastAPI and React.☆139Updated 7 months ago
- Query CSV, JSON and Parquet files with SQL☆103Updated 5 months ago
- Dabbling with ReAct chatbots☆164Updated 3 months ago
- Action library for AI Agent☆191Updated 2 weeks ago
- A Python library for programmatically generating Draw.io charts.☆235Updated last month
- A pythonic library providing light-weighted interface with LLMs☆119Updated 2 weeks ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆20Updated last year
- clean & curate your data with LLMs.☆470Updated 4 months ago
- A zero-setup, easy to use document store for Python☆69Updated last month
- A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.☆182Updated 4 months ago
- a lightweight, comprehensive solution for managing delta tables built on polars and deltalake☆108Updated last week
- Structured information extraction from documents☆282Updated last month
- ☆260Updated 7 months ago
- ☆162Updated 3 weeks ago