peakhell / OCRIntegratorLinks
OCRFusion is an integrated solution that combines multiple open-source OCR (Optical Character Recognition) models, layout analysis, and table parsing capabilities. This project unifies these functionalities into a single interface, providing a streamlined and efficient way to process and extract information from various types of documents.
☆16Updated last year
Alternatives and similar repositories for OCRIntegrator
Users that are interested in OCRIntegrator are comparing it to the libraries listed below
Sorting:
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆244Updated this week
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated 2 years ago
- 千问14B和7B的逐行解释☆64Updated 2 years ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆65Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆60Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated 2 years ago
- Based on RapidOCR, extract the PDF content☆185Updated 9 months ago
- share data, prompt data , pretraining data☆36Updated 2 years ago
- LLama3中文个人版本☆39Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Updated last year
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆124Updated last week
- qwen2 and llama3 cpp implementation☆49Updated last year
- qwen models finetuning☆106Updated 10 months ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆63Updated last year
- 文档方向分类☆222Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Updated last year
- bge推理优化相关脚本☆29Updated 2 years ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆207Updated last year
- 视频理解:千问视频 多模态模型 & Dify☆66Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 4 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- ☆175Updated last year
- llama inference for tencentpretrain☆99Updated 2 years ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated 2 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Updated last year
- 百度QA100万数据集☆46Updated 2 years ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆28Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆140Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Updated last year
- ☆28Updated last year