axa-group / Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
☆5,855Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for Parsr
- A Repo For Document AI☆2,593Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,513Updated 2 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,997Updated this week
- An easy way to extract information from documents☆1,717Updated last year
- Build data pipelines, the easy way 🛠️☆4,079Updated last year
- Convert Jupyter Notebooks to Web Apps☆4,052Updated 4 months ago
- A web interface to extract tabular data from PDFs☆1,593Updated 6 months ago
- App to easily query, script, and visualize data from every database, file, and API.☆2,903Updated last year
- Postgres with GPUs for ML/AI apps.☆6,039Updated this week
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆3,898Updated this week
- A Python library to extract tabular data from PDFs☆3,029Updated 3 months ago
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,262Updated 3 weeks ago
- A machine learning software for extracting information from scholarly documents☆3,590Updated this week
- Build and share data reports in 100% Python☆1,381Updated last year
- PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis☆13,401Updated this week
- An open source alternative to Tableau. Embeddable visual analytic☆2,528Updated 2 weeks ago
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand …☆1,219Updated last month
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,213Updated 8 months ago
- Camelot: PDF Table Extraction for Humans☆3,666Updated last year
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆9,460Updated this week
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆6,770Updated last week
- Grist is the evolution of spreadsheets.☆7,279Updated this week
- ♾️ CML - Continuous Machine Learning | CI/CD for ML☆4,044Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,470Updated 8 months ago
- Next generation of automated data exploratory analysis and visualization platform.☆4,277Updated 3 months ago
- 🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.☆3,104Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆5,706Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆16,242Updated this week
- Dolt – Git for Data☆17,981Updated this week
- Always know what to expect from your data.☆10,004Updated this week