axa-group / ParsrLinks
Transforms PDF, Documents and Images into Enriched Structured Data
☆6,009Updated last year
Alternatives and similar repositories for Parsr
Users that are interested in Parsr are comparing it to the libraries listed below
Sorting:
- An easy way to extract information from documents☆1,777Updated 2 years ago
- A Repo For Document AI☆2,951Updated 2 weeks ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,470Updated last year
- Build data pipelines, the easy way 🛠️☆4,143Updated 2 years ago
- borb is a library for reading, creating and manipulating PDF files in python.☆3,523Updated last week
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,454Updated 9 months ago
- A web interface to extract tabular data from PDFs☆1,700Updated 8 months ago
- Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.☆2,020Updated 5 months ago
- extract text from any document. no muss. no fuss.☆4,281Updated 9 months ago
- Community maintained fork of pdfminer - we fathom PDF☆6,687Updated 4 months ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,153Updated 2 months ago
- Camelot: PDF Table Extraction for Humans☆3,701Updated 2 years ago
- A machine learning software for extracting information from scholarly documents☆4,297Updated this week
- Wasm powered Jupyter running in the browser 💡☆4,492Updated last week
- 🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.☆3,131Updated this week
- A Python library to extract tabular data from PDFs☆3,415Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,602Updated 3 months ago
- Create delightful software with Jupyter Notebooks☆5,179Updated last month
- 📚 Parameterize, execute, and analyze notebooks☆6,256Updated 2 months ago
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,613Updated 4 months ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,524Updated last year
- An open source multi-tool for exploring and publishing data☆10,314Updated last month
- Software that makes labeling PDFs easy.☆420Updated last year
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,246Updated 3 years ago
- Convert Jupyter Notebooks to Web Apps☆4,262Updated 3 months ago
- Voilà turns Jupyter notebooks into standalone web applications☆5,803Updated last week
- App to easily query, script, and visualize data from every database, file, and API.☆2,941Updated last year
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,520Updated last week
- A Python library for reading and writing PDF, powered by QPDF☆2,454Updated 3 weeks ago
- Programmatically collect normalized news from (almost) any website.☆2,968Updated 4 years ago