OKC13 / General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
☆45Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for General-Documents-Layout-parser
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆33Updated 2 months ago
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- ☆26Updated 3 weeks ago
- ☆21Updated last month
- 介绍docker、docker compose的使用。☆20Updated 2 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆46Updated last year
- TianGong-AI-Unstructure☆51Updated this week
- use chatGLM to perform text embedding☆45Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆106Updated last year
- 时间抽取、解析、标准化工具☆49Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆76Updated last year
- bge推理优化相关脚本☆24Updated 9 months ago
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Updated last year
- code for piccolo embedding model from SenseTime☆110Updated 6 months ago
- LLM for NER☆54Updated 3 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 7 months ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆14Updated last year
- 一个用于训练句子embedding的工具,支持Cosent以及Simcse☆17Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆241Updated 2 months ago
- ☆60Updated 2 months ago
- Easy-to-use and Fast NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.☆11Updated 8 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- 基于sentence-transformers实现文本转向量的机器人☆45Updated 2 years ago
- 骆驼QA,中文大语言阅读理解模型。☆72Updated last year
- 文本去重☆67Updated 5 months ago
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆89Updated last year
- SmartSearch: Building a quick conversation-based search engine with LLMs.☆42Updated 6 months ago