soham-1 / fastapi_pdfextractorLinks
An api using fastapi for extracting the text content of pdf using pdfminer. It also supports scanned images in pdf's by using tesseract and ocrmypdf.
☆15Updated 3 years ago
Alternatives and similar repositories for fastapi_pdfextractor
Users that are interested in fastapi_pdfextractor are comparing it to the libraries listed below
Sorting:
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated last year
- FastAPI Async MongoDB Boiler Plate RestAPI☆39Updated last year
- FastAPI-PostgreSQL-Celery-RabbitMQ-Redis bakcend with Docker containerization☆73Updated last year
- This is simple REST API project using a modern stack with FastAPI. (Celery, Redis, Postgres, SQLAlchemy, Docker, Docker Compose)☆40Updated 2 years ago
- A simple docker-compose app for orchestrating a fastapi application, a celery queue with rabbitmq(broker) and redis(backend)☆134Updated 2 years ago
- Production ready boilerplate to start with Fastapi☆27Updated 3 years ago
- FastAPI with Docker and Traefik☆112Updated 2 years ago
- Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings f…☆13Updated 9 months ago
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆37Updated 2 months ago
- Application configuration and scripts for search on https://docs.vespa.ai/☆12Updated this week
- ☆28Updated last year
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆48Updated last year
- T.I.M.E: Thoroughly Intelligent Mail Explorer" Repo to try and build an incredible RAG system over email (this is to test the SOTA in RAG…☆21Updated 4 months ago
- use GPT3 to generate SQL from text☆13Updated 2 months ago
- ☆13Updated 9 months ago
- Self-host llmapi server, make it really easy for accessing LLMs !☆37Updated 2 years ago
- ☆56Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated last year
- ☆55Updated 2 years ago
- Opinionated Langchain setup with Qdrant vector store and Kong gateway☆33Updated 2 years ago
- Extracting structured JSON from credit card statements using Langchain and Pydantic☆23Updated 11 months ago
- Minimal example utilizing Fastapi and celery with Redis for celery back-end and task queue, and flower for monitoring the celery tasks.☆69Updated last year
- Python SDK Client for ZincSearch☆11Updated 2 years ago
- Fullstack Web Application Framework With FastAPI + Vite + VueJS. Streamlit for rapid development.☆40Updated last year
- Sample project showing reliable data ingestion application using FastAPI and dramatiq☆44Updated 3 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated 2 months ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆40Updated last year
- Experiment on QnA tabular data using LLMs and SQL☆29Updated 7 months ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated 2 months ago