KaixuanZ / PR1956
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for PR1956
- Code and data for the paper at http://arxiv.org/abs/2004.07317☆16Updated 4 years ago
- A model(ing framework) for sample efficient OCR☆53Updated last year
- A Large Dataset of Historical Japanese Documents with Complex Layouts☆32Updated 2 years ago
- A dataset of region-annotated scientific articles.☆20Updated 4 years ago
- Noise-robust de-duplication at scale☆15Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- ☆55Updated 3 years ago
- dev repo for article☆25Updated last year
- Codes required to implement various approaches to historical record linking☆18Updated 4 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆70Updated 3 weeks ago
- A shared repository for data cleaning scripts used for innovation data.☆29Updated 3 years ago
- US utility patent similarity data creation and analysis tools☆26Updated 4 years ago
- baselines for DocVQA dataset☆20Updated 3 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- ☆37Updated 3 years ago
- dhSegment on pytorch☆32Updated last year
- This package consists of functionalities for dynamic topic modelling and its visualization☆24Updated 4 years ago
- ☆30Updated 4 months ago
- This is an OCR solution for receipts, invoices, etc.☆20Updated 4 years ago
- ☆25Updated 4 years ago
- Fast, flexible name matching for large datasets☆70Updated 11 months ago
- Course page for KU course on text data and deep learning https://kurser.ku.dk/course/a%c3%98kk08401u/2019-2020☆9Updated 4 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆28Updated 5 years ago
- Slides and jupter notebooks for course on text analysis and machine learning for social science☆24Updated 3 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆66Updated 2 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆45Updated 4 years ago
- ☆39Updated 3 years ago
- A simple toolkit for conducting analyses using corpus methods☆24Updated 3 years ago
- ☆15Updated 7 years ago
- AI_DocumentLayoutAnalysis☆38Updated 4 years ago