CodingMonkey12 / Semantic-Search-using-PaddleLinks
基于Paddle进行语义检索并部署上线,支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support
☆13Updated 3 years ago
Alternatives and similar repositories for Semantic-Search-using-Paddle
Users that are interested in Semantic-Search-using-Paddle are comparing it to the libraries listed below
Sorting:
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- bge推理优化相关脚本☆29Updated last year
- 文档方向分类☆226Updated 10 months ago
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 3 years ago
- Graph QABot Demo| 图谱问答案例☆14Updated 2 years ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆106Updated 2 months ago
- 时间抽取、解析、标准化工具☆55Updated 2 years ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆78Updated 3 months ago
- ☆27Updated 11 months ago
- Based on RapidOCR, extract the PDF content☆181Updated 5 months ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆60Updated last year
- 采用一个模型同时实现问题生成和答案生成☆29Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- 一站式自动化开源标注平台☆78Updated 3 years ago
- 阅读顺序、Layoutreader☆19Updated 5 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆121Updated 3 months ago
- 一个基于预训练的句向量生成工具☆137Updated 2 years ago
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆32Updated 2 years ago
- 长文本相似度模型☆21Updated last year
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆109Updated 10 months ago
- 介绍docker、docker compose的使用。☆21Updated last year
- FinanceEventGraph,金融领域事件图谱开放数据集,可用于事件图谱搭建于实验,包括3865个acquire并购事件、9093个invest投资事件,总计12960的事件☆20Updated last year
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆24Updated 2 years ago
- 智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include …☆27Updated 2 years ago
- 🌳CED: Catalog Extraction from Documents☆16Updated 2 years ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆226Updated last year
- Minimal keyword extraction with BERT☆88Updated 3 years ago
- benchmark of KgCLUE, with different models and methods☆28Updated 3 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆302Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week