CyberCRI / refinedocLinks
Python library for extracting headers, footers and body from PDF
☆21Updated last week
Alternatives and similar repositories for refinedoc
Users that are interested in refinedoc are comparing it to the libraries listed below
Sorting:
- A better job search based on semantic matching☆17Updated last year
- Airbnb scraper made in Python☆110Updated this week
- Prompt markup language (A.K.A PromptML) library is specially built for AI systems - from Vidura AI☆59Updated 3 months ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆276Updated 3 weeks ago
- Spider ported to Python☆103Updated 2 weeks ago
- Python Wrapper on top of Unofficial Medium API to quickly extract data from Medium's website.☆61Updated 6 months ago
- Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markd…☆52Updated 5 months ago
- Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit.☆74Updated last year
- Web application that converts audio and video to text using AI, supporting various formats and self-hosting.☆129Updated 10 months ago
- AI assisted web scraping and data extraction☆207Updated this week
- Mixpost Installation with Docker Containers☆13Updated 2 years ago
- Open Source LinkedIn Scraper☆122Updated last month
- Code AI Fusion is created with objective of bringing the Generative Power AI and mix with the pure logic that is written with code.☆24Updated 5 months ago
- EmailGenius: AI-Driven Email Categorization☆30Updated 2 years ago
- automate chatgpt using selenium without api☆75Updated last year
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆52Updated last year
- A Python asyncio wrapper for Tesseract-OCR.☆27Updated 3 weeks ago
- REST API for Large Language Models using FastAPI, Redis and LiteLLM