soham-1 / fastapi_pdfextractorLinks
An api using fastapi for extracting the text content of pdf using pdfminer. It also supports scanned images in pdf's by using tesseract and ocrmypdf.
☆15Updated 4 years ago
Alternatives and similar repositories for fastapi_pdfextractor
Users that are interested in fastapi_pdfextractor are comparing it to the libraries listed below
Sorting:
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆49Updated last year
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated last year
- Self-host llmapi server, make it really easy for accessing LLMs !☆37Updated 2 years ago
- Redis Queue Dashboard based on FastAPI☆104Updated 5 months ago
- A Low-Code, Rapid Application Development (RAD) tool to assist with structure and building full stack applications☆19Updated last year
- Multi-Channel Message Relay with AI Customer Service 🚀 A FastAPI-powered messaging system for Email ✉️, SMS 📲, and Voice 🎙️, featuring…☆10Updated this week
- Python Wrapper on top of Unofficial Medium API to quickly extract data from Medium's website.☆58Updated 2 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- Library that helps use puppeteer in scrapy.☆52Updated this week
- Fullstack Web Application Framework With FastAPI + Vite + VueJS. Streamlit for rapid development.☆40Updated 2 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- A minimal implementation of GraphRAG, designed to quickly prototype whether you're able to get good sense-making out of a large dataset w…☆32Updated 5 months ago
- Add dependencies specified in requirements.txt file(s) to your Poetry or UV project☆33Updated 3 months ago
- Fully working applications that demonstrate how to use Haystack to implement various use cases☆121Updated 3 months ago
- Add AI intelligence to your tests with AutoFlow☆61Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- a series of tutorials implementing rag service with BentoML and LlamaIndex☆44Updated 6 months ago
- Application configuration and scripts for search on https://docs.vespa.ai/☆12Updated last month
- Chatroom app where messages are sent to GPT, Claude, Mistral, Together, Grok, Groq, Google, vLLM, Ollama & streamed to the frontend.☆40Updated 3 weeks ago
- Automatically pass your funcions defined in Python to ChatGPT have it call them back seemlessly.☆13Updated 2 years ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated 3 months ago
- Sample fastAPI Application to demonstrate OpenTelemetry instrumentation☆14Updated last year
- Transform Oracle PL/SQL Code to Python☆11Updated 11 years ago
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆38Updated 4 months ago
- Spider ported to Python☆87Updated 5 months ago
- This is web app by streamlit, this app scrape facebook page and show some statictic and visualize the date☆26Updated last year
- Scrapy project template. Use it to quickly spin up a new web scraping project☆17Updated 8 months ago
- Scrapy project boilerplate done right☆48Updated 5 months ago
- Experiment on QnA tabular data using LLMs and SQL☆29Updated 8 months ago
- A repository for creating, and sample code for consuming an ONNX embedding model☆32Updated 2 years ago