CodingMonkey12 / Semantic-Search-using-PaddleLinks
基于Paddle进行语义检索并部署上线,支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support
☆12Updated 2 years ago
Alternatives and similar repositories for Semantic-Search-using-Paddle
Users that are interested in Semantic-Search-using-Paddle are comparing it to the libraries listed below
Sorting:
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated last year
- bge推理优化相关脚本☆28Updated last year
- ☆27Updated 8 months ago
- 介绍docker、docker compose的使用。☆20Updated 9 months ago
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 2 years ago
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆70Updated 3 months ago
- 🌳CED: Catalog Extraction from Documents☆16Updated last year
- ☆15Updated last year
- 时间抽取、解析、标准化工具☆52Updated 2 years ago
- 文本纠错(Text Correct, CSC), 支持中文文本纠错(拼写纠错/标点符号纠错/繁体纠错)(CSC, Chinese Spelling Correct / Check; Punct), CSC支持各领域数据的中文文本纠错(包括古文), 模型在大规模、各领域的、现…☆26Updated 3 weeks ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆48Updated 9 months ago
- FinCUGE Instruction dataset☆12Updated 2 years ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆23Updated 2 years ago
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆45Updated 3 months ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆9Updated 6 months ago
- 智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include …☆25Updated 2 years ago
- A Multi-Modal Dataset of Chinese Governmental Docunments☆34Updated 4 years ago
- 支持ChatGLM2 lora微调☆40Updated last year
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测 尝试。☆13Updated 2 years ago
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆40Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Updated last year
- Based on RapidOCR, extract the PDF content☆172Updated last month
- Here is a demo for PDF parser (Including OCR, object detection tools)☆35Updated 8 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 11 months ago
- TianGong-AI-Unstructure☆67Updated last week
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆45Updated 2 years ago
- aigc evals☆10Updated last year
- Graph QABot Demo| 图谱问答案例☆15Updated 2 years ago
- 文档方向分类☆219Updated 7 months ago
- 在kaggle部署ChatGLM API,和ChatGPT api使用相同的调用方式☆14Updated last year