Tanguy9862 / Medical-OCR-Data-ExtractionLinks
An object-oriented Python script for extracting structured data from medical documents. Successfully processed 2,000+ files, combining OCR technology to output clean datasets for analytics. Includes collaboration with medical professionals and statistical analysis via RMarkdown.
☆8Updated 4 months ago
Alternatives and similar repositories for Medical-OCR-Data-Extraction
Users that are interested in Medical-OCR-Data-Extraction are comparing it to the libraries listed below
Sorting:
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆52Updated 7 months ago
- ☆17Updated 4 years ago
- ☆18Updated 2 years ago
- A Data Centric NER annotation tool for your Named Entity Recognition projects☆47Updated last year
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- Leverages extensive power of multiple Machine Learning algorithms & LLM to provide in-depth answers to medical queries and predicts condi…☆47Updated last year
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆34Updated 2 years ago
- Code repository for my tutorial series which covers beginner to advanced topics on OpenCV with Python.☆23Updated last month
- Working codes for project☆23Updated last year
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆18Updated 7 months ago
- A toolkit for training CNN-1DRNN-CTC model to perform line-level Handwritten Text Recognition☆9Updated 5 years ago
- IAM dataset☆59Updated 2 years ago
- A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping☆106Updated 2 years ago
- ☆15Updated 2 years ago
- ☆10Updated 2 months ago
- ☆16Updated 5 months ago
- Spacy, HAC, pytesseract, easyocr, doctr, mmocr, layoutlm, paddleocr☆20Updated last year
- The Llama-2-GGML-CSV-Chatbot is a conversational tool leveraging the powerful Llama-2 7B language model. It facilitates multi-turn intera…☆11Updated last month
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Updated last year
- Detect handwritten words (neural network based).☆70Updated 3 years ago
- ☆22Updated last year
- Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"☆15Updated 3 years ago
- ☆33Updated 4 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆58Updated 8 months ago
- Used for reading medical prescription and coverting it in digital form☆114Updated 6 years ago
- ☆17Updated 10 months ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆25Updated 6 months ago
- A simple document detector in python3☆51Updated 2 years ago
- ☆35Updated 4 years ago