axa-group / ParsrLinks
Transforms PDF, Documents and Images into Enriched Structured Data
☆6,008Updated last year
Alternatives and similar repositories for Parsr
Users that are interested in Parsr are comparing it to the libraries listed below
Sorting:
- An easy way to extract information from documents☆1,776Updated 2 years ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,455Updated 9 months ago
- Build data pipelines, the easy way 🛠️☆4,142Updated 2 years ago
- borb is a library for reading, creating and manipulating PDF files in python.☆3,523Updated 2 weeks ago
- AI code-writing assistant that understands data content☆2,289Updated last year
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,397Updated last week
- A Repo For Document AI☆2,957Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,691Updated last week
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,488Updated last year
- extract text from any document. no muss. no fuss.☆4,281Updated 9 months ago
- Create delightful software with Jupyter Notebooks☆5,184Updated last month
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,601Updated 3 months ago
- A Python library to extract tabular data from PDFs☆3,421Updated last week
- 🦘 Explore multimedia datasets at scale☆1,064Updated 9 months ago
- Camelot: PDF Table Extraction for Humans☆3,700Updated 2 years ago
- A web interface to extract tabular data from PDFs☆1,706Updated 8 months ago
- Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.☆2,023Updated 6 months ago
- App to easily query, script, and visualize data from every database, file, and API.☆2,944Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeli…☆4,400Updated this week
- Build and share data reports in 100% Python☆1,401Updated last year
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…☆4,238Updated 7 months ago
- Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet☆2,525Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,900Updated 3 weeks ago
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆5,550Updated this week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,307Updated 3 weeks ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,942Updated 2 months ago
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand …☆1,337Updated last week
- Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.☆3,854Updated last year
- Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation☆3,200Updated 2 months ago