axa-group / ParsrLinks
Transforms PDF, Documents and Images into Enriched Structured Data
☆6,149Updated 2 years ago
Alternatives and similar repositories for Parsr
Users that are interested in Parsr are comparing it to the libraries listed below
Sorting:
- An easy way to extract information from documents☆1,783Updated 2 years ago
- A web interface to extract tabular data from PDFs☆1,787Updated last year
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,395Updated 3 months ago
- A Python library to extract tabular data from PDFs☆3,565Updated this week
- A Repo For Document AI☆3,119Updated this week
- Build data pipelines, the easy way 🛠️☆4,145Updated 2 years ago
- Camelot: PDF Table Extraction for Humans☆3,710Updated 3 years ago
- borb is a library for reading, creating and manipulating PDF files in python.☆3,551Updated 2 weeks ago
- 📄 🤖 AI for medical and scientific papers☆1,678Updated 6 months ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,471Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆329Updated 2 years ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,003Updated last week
- An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data…☆6,549Updated last week
- Create delightful software with Jupyter Notebooks☆5,243Updated last week
- AI code-writing assistant that understands data content☆2,292Updated last year
- 🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.☆3,137Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,702Updated last week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,315Updated last month
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,634Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,809Updated 3 weeks ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,743Updated last year
- Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.☆2,050Updated 9 months ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,609Updated 7 months ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,631Updated last year
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆9,474Updated last week
- Software that makes labeling PDFs easy.☆425Updated last year
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand …☆1,370Updated last month
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,151Updated 4 months ago
- Text preprocessing, representation and visualization from zero to hero.☆2,916Updated 2 years ago
- An open source multi-tool for exploring and publishing data☆10,666Updated this week