tianchiguaixia / medical_ocr_streamlitLinks

该项目主要是为了识别图片里面的表格数据，并将表格数据抽取处理，导出成csv的文件。整个项目会使用streamlit进行部署和展示。使用的技术：paddleocr，PPStructure，streamlit

☆34

Alternatives and similar repositories for medical_ocr_streamlit

Users that are interested in medical_ocr_streamlit are comparing it to the libraries listed below

Sorting:

xhw205 / PaddleOCR_AlignText
PaddleOCR 输出结果的行对齐，表格制式图像OCR行对齐
☆44Updated 3 years ago
tianchiguaixia / medical_records_extract
该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型：图片，pdf文件，word文件
☆23Updated 2 years ago
RapidAI / RapidOrientation
文档方向分类
☆219Updated 7 months ago
zhangnn520 / znn_chatglm
打造人人都会的NLP，开源不易，记得star哦
☆101Updated 2 years ago
wxwwt / opencv-picture-to-excel
使用python-opencv识别图片中的表格数据转换为csv
☆111Updated 5 years ago
tianchiguaixia / layoutlmv3-chinese
该项目是为了使用layoutlmv3针对中文图片训练和推理。其中主要解决三个问题： 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作
☆48Updated 9 months ago
WangRongsheng / PaddleOCR-Flask-deploy
✅Deploy PaddleOCR with flask | 利用Flask对PaddleOCR进行部署，方便调用
☆41Updated 3 years ago
OKC13 / General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
☆46Updated last year
hiroi-sora / GapTree_Sort_Algorithm
【间隙·树·排序算法】对OCR结果或PDF提取的文本进行版面分析，按人类阅读顺序进行排序。
☆140Updated last year
pyunits / pyunit-ner
NER实体识别模型,快速高效简单一键部署docker部署调用模型。能识别：地址、人名、机构名实体。
☆36Updated last year
tianchiguaixia / qwen1.5-ner
使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调，旨在：验证生成式方法相较于抽取式NER的效果；为新手提供简易的模型微调流程，尽量减少代码量；大模型训练的数据格式处理。
☆13Updated 9 months ago
RapidAI / RapidTableDetection
检测和提取各种场景图片中的表格区域，并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…
☆98Updated 6 months ago
cuiwang / MarkStudio
中文标注工具，支持NER、文本分类、关系标注、对话标注等。
☆76Updated 10 months ago
wp931120 / LongChainKBQA
kbqa,langchain,large langauge model, chatgpt
☆81Updated 8 months ago
bitdata / ocrtable
Recognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字
☆256Updated 2 years ago
duanyu / LabelFast
中文世界的NLP自动标注开源工具，简单样本，交给LabelFast。
☆73Updated 5 months ago
pyunits / pyunit-time
一个简单易用的 Python 模块，用于通过字符串来操作日期/时间。正则时间提取，字符串时间解析，字符串时间提取。中文时间提取，一句话里面提取时间
☆75Updated 11 months ago
taishan1994 / Qwen2-UIE
基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】
☆31Updated 11 months ago
Kratosssssss / ChatGLM2-Langchain-Agent-demo
☆21Updated last year
jiangnanboy / layout_analysis
中文版面检测（Chinese layout detection），yolov8 is used to detect the layout of Chinese document images。
☆58Updated 2 years ago
Vincent131499 / Chinese-OCR3
从NLP出发对于OCR的深度实践集锦，重在实战
☆89Updated 4 years ago
WenmuZhou / TableGeneration
通过浏览器渲染生成表格图像
☆224Updated last year
ck-unifr / pdf_parsing
PDF解析（文字，章节，表格，图片，参考），基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答，摘要，信息抽取
☆202Updated last year
MoranCoder95 / pipeline-ChatGLM
流水线系统(pipeline)构建基于本地知识库的ChatGLM问答
☆87Updated 2 years ago
chatchat-space / chatchat-knowledgebase
属于每个人的公众号”查特查特“上线啦！新问题、新方法、新发现，欢迎提PR！
☆45Updated last year
shibing624 / imgocr
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SO…
☆87Updated 5 months ago
JIN-strong / Table-OCR-based-on-DeepLearning
表格检测和表结构识别
☆22Updated 2 years ago
RapidAI / RapidOCRPDF
Based on RapidOCR, extract the PDF content
☆172Updated last month
lingerun / KGQA_insurance_product
基于开源保险产品数据构建的保险知识图谱及简易问答系统
☆38Updated 5 years ago
polygraph8 / 5minNLP
五分钟NLP 知识
☆43Updated 4 years ago