EnkrateiaLucca / ocr_for_transcribing_pdf_slides
Code for my medium article: ["Faster Notes with Python and Deep Learning"](https://medium.com/p/b713bbb3c186/edit)
☆138Updated 3 years ago
Alternatives and similar repositories for ocr_for_transcribing_pdf_slides
Users that are interested in ocr_for_transcribing_pdf_slides are comparing it to the libraries listed below
Sorting:
- 一个相对完整的文档分析和识别项目☆144Updated 5 years ago
- 公式图片ocr,输入图片输出对应的latex表达式☆291Updated 5 years ago
- ☆82Updated 3 years ago
- pretrained models for cnocr☆56Updated 3 years ago
- tech maps☆27Updated 5 years ago
- A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.☆177Updated 2 years ago
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆57Updated last year
- 使用GAN擦出文档印章 remove stamp by GAN☆157Updated 3 years ago
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆739Updated 2 months ago
- Recognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字☆254Updated last year
- Based on RapidOCR, extract the PDF content☆166Updated last week
- ☆255Updated 9 months ago
- DangoOCR: screenshot OCR recognize 文字识别,支持多种语言,识别后翻译,播放声音☆53Updated 4 years ago
- table detect(yolo) , table line(unet)☆250Updated last year
- 利用语言模型,纠正OCR识别错误☆464Updated last year
- ☆594Updated 8 months ago
- Recognize tables from images and restore them into word.☆273Updated last year
- 一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.☆121Updated 3 years ago
- LaTeX OCR 的数据仓库☆119Updated 11 months ago
- 使用python-opencv识别图片中的表格数据转换为csv☆110Updated 5 years ago
- PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-…☆202Updated last year
- CCF2019-OCR身份证要素识别-数据生成器☆152Updated 4 years ago
- 文档方向分类☆217Updated 5 months ago
- table structure recognition☆274Updated 2 years ago
- 通过浏览器渲染生成表格图像☆217Updated last year
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- 身份证复印件识别,CCF BDCI参赛项目☆62Updated 5 years ago
- 基于tensorflow、keras/pytorch实现对自然场景的文字检测及端到端的OCR中文文字识别☆102Updated 6 years ago
- 书籍《现代自然语言生成》介绍☆218Updated 4 years ago
- 从NLP出发对于OCR的深度实践集锦,重在实战☆88Updated 4 years ago