peakhell / OCRIntegratorLinks
OCRFusion is an integrated solution that combines multiple open-source OCR (Optical Character Recognition) models, layout analysis, and table parsing capabilities. This project unifies these functionalities into a single interface, providing a streamlined and efficient way to process and extract information from various types of documents.
☆16Updated 10 months ago
Alternatives and similar repositories for OCRIntegrator
Users that are interested in OCRIntegrator are comparing it to the libraries listed below
Sorting:
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆35Updated 5 months ago
- bge推理优化相关脚本☆28Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 5 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated 11 months ago
- 多显卡部署版 | ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model☆62Updated 2 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆53Updated last year
- ☆90Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆61Updated 7 months ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆108Updated last year
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆69Updated 2 months ago
- qwen2 and llama3 cpp implementation☆44Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆184Updated last week
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 8 months ago
- Imitate OpenAI with Local Models☆87Updated 9 months ago
- 千问14B和7B的逐行解释☆60Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 9 months ago
- 大语言模型训练和服务调研☆37Updated last year
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆48Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 6 months ago
- Qwen-Efficient-Tuning☆43Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆82Updated 4 months ago
- 高性能文本 Tokenizer 库☆29Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆90Updated last year
- LLama3中文个人版本☆39Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated 5 months ago
- NLP 项目记录档案☆54Updated last month
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆34Updated 10 months ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆93Updated last year