axa-group / Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
β5,953Updated last year
Alternatives and similar repositories for Parsr:
Users that are interested in Parsr are comparing it to the libraries listed below
- π‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflowsβ10,758Updated last week
- Build data pipelines, the easy way π οΈβ4,113Updated last year
- An easy way to extract information from documentsβ1,750Updated last year
- Camelot: PDF Table Extraction for Humansβ3,681Updated 2 years ago
- borb is a library for reading, creating and manipulating PDF files in python.β3,467Updated 4 months ago
- A Repo For Document AIβ2,796Updated 2 weeks ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,276Updated last week
- Fuzzy string matching, grouping, and evaluation.β759Updated 2 months ago
- extract text from any document. no muss. no fuss.β4,094Updated 4 months ago
- Fuzzy String Matching in Pythonβ9,257Updated 2 years ago
- A web interface to extract tabular data from PDFsβ1,651Updated 3 months ago
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand β¦β1,264Updated 6 months ago
- Build, Manage and Deploy AI/ML Systemsβ8,742Updated this week
- Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheetβ2,443Updated this week
- Text preprocessing, representation and visualization from zero to hero.β2,903Updated last year
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,268Updated 5 months ago
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convertβ¦β20,377Updated last week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,654Updated 2 weeks ago
- The Open Source Feature Store for AI/MLβ5,975Updated this week
- Community maintained fork of pdfminer - we fathom PDFβ6,386Updated last week
- A Python library for reading and writing PDF, powered by QPDFβ2,325Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,371Updated 6 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,050Updated 2 weeks ago
- Create full-fledged APIs for slowly moving datasets without writing a single line of code.β3,289Updated last month
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β6,065Updated this week
- ZenML π: The bridge between ML and Ops. https://zenml.io.β4,546Updated this week
- A Unified Toolkit for Deep Learning Based Document Image Analysisβ5,201Updated 8 months ago
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,557Updated 7 months ago
- Fuzzy String Matching in Pythonβ3,176Updated last month
- Panel: The powerful data exploration & web app framework for Pythonβ5,161Updated last week