shreyshah97 / Newspaper-Segmentation
Newspaper Segmentation into images and text
☆12Updated 6 years ago
Alternatives and similar repositories for Newspaper-Segmentation:
Users that are interested in Newspaper-Segmentation are comparing it to the libraries listed below
- Segmenting text blocks and baselines from documents using deep learning techniques☆13Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆28Updated 3 years ago
- ☆138Updated last year
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 8 months ago
- Detect handwritten words (neural network based).☆69Updated 3 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- TensorFlow implementation of a segmentation system for document images.☆34Updated 6 years ago
- Python 3 library for processing historical English☆67Updated 8 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- LSTM text generation by word. Used to generate multiple sentence suggestions based on the input words or a sentence☆27Updated 4 years ago
- OCR-D-compliant page segmentation☆67Updated last month
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- ☆22Updated 5 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- ☆11Updated 3 years ago
- ☆10Updated 4 years ago
- Bilingual term extractor☆53Updated last year
- Document processing using transformers☆20Updated 2 years ago
- Keras implementation of character-level sequence-to-sequence learning for spelling correction☆73Updated 6 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆71Updated last year
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last week
- ☆16Updated 4 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Detect textlines in document images☆92Updated 10 months ago
- CVPR 2022: Table Structure Recognition☆39Updated 3 years ago
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 2 years ago
- ☆28Updated 3 years ago
- OCR & Ground Truth Resources☆75Updated 2 years ago