axa-group / Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
β5,936Updated last year
Alternatives and similar repositories for Parsr:
Users that are interested in Parsr are comparing it to the libraries listed below
- An open source multi-tool for exploring and publishing dataβ9,866Updated this week
- Build data pipelines, the easy way π οΈβ4,113Updated last year
- An easy way to extract information from documentsβ1,739Updated last year
- A Python library to extract tabular data from PDFsβ3,196Updated this week
- An open source alternative to Tableau. Embeddable visual analyticβ2,750Updated last week
- A web interface to extract tabular data from PDFsβ1,638Updated 2 months ago
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,076Updated this week
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagemβ¦β2,108Updated last month
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,554Updated 5 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.β4,390Updated this week
- borb is a library for reading, creating and manipulating PDF files in python.β3,451Updated 3 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,366Updated this week
- An open-source, low-code machine learning library in Pythonβ9,197Updated last week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved diβ¦β1,279Updated last week
- extract text from any document. no muss. no fuss.β3,994Updated 3 months ago
- Automatically visualize your pandas dataframe via a single print! π π‘β5,259Updated 11 months ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifactβ¦β1,427Updated 3 months ago
- Merlion: A Machine Learning Framework for Time Series Intelligenceβ4,209Updated 8 months ago
- Create delightful software with Jupyter Notebooksβ5,039Updated last week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.β6,676Updated this week
- Declarative visualization library for Pythonβ9,637Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,615Updated last week
- Postgres with GPUs for ML/AI apps.β6,189Updated 2 weeks ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.β2,231Updated 2 years ago
- Production infrastructure for machine learning at scaleβ8,030Updated 9 months ago
- Streaming replication for SQLite.β11,532Updated 3 months ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,354Updated 5 months ago
- Camelot: PDF Table Extraction for Humansβ3,677Updated 2 years ago
- π Parameterize, execute, and analyze notebooksβ6,106Updated 2 months ago
- Low-code framework for building custom LLMs, neural networks, and other AI modelsβ11,362Updated last week