axa-group / Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
☆5,941Updated last year
Alternatives and similar repositories for Parsr:
Users that are interested in Parsr are comparing it to the libraries listed below
- An easy way to extract information from documents☆1,744Updated last year
- Camelot: PDF Table Extraction for Humans☆3,678Updated 2 years ago
- Build data pipelines, the easy way 🛠️☆4,114Updated last year
- An open source multi-tool for exploring and publishing data☆9,906Updated this week
- A Repo For Document AI☆2,764Updated this week
- A web interface to extract tabular data from PDFs☆1,646Updated 2 months ago
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆10,660Updated this week
- App to easily query, script, and visualize data from every database, file, and API.☆2,923Updated last year
- Wasm powered Jupyter running in the browser 💡☆4,067Updated this week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,122Updated this week
- Text preprocessing, representation and visualization from zero to hero.☆2,905Updated last year
- AI code-writing assistant that understands data content☆2,252Updated last year
- Free, open-source SQL client for Windows and Mac 🦅☆5,125Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,508Updated 6 months ago
- Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet☆2,365Updated this week
- Links to awesome OCR projects☆2,938Updated 8 months ago
- Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame☆2,244Updated 3 months ago
- A desktop application for viewing and analyzing tabular data☆3,276Updated 3 weeks ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,397Updated last week
- 📄 🤖 Semantic search and workflows for medical/scientific papers☆1,388Updated 3 months ago
- Create delightful software with Jupyter Notebooks☆5,047Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,559Updated 6 months ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,251Updated 4 months ago
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,284Updated 2 weeks ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,357Updated 5 months ago
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆4,993Updated this week
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,233Updated 2 years ago
- Containers for machine learning☆8,506Updated this week
- A terminal spreadsheet multitool for discovering and arranging data☆8,127Updated 2 weeks ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,667Updated last year