CodingMonkey12 / Semantic-Search-using-PaddleLinks
基于Paddle进行语义检索并部署上线,支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support
☆13Updated 3 years ago
Alternatives and similar repositories for Semantic-Search-using-Paddle
Users that are interested in Semantic-Search-using-Paddle are comparing it to the libraries listed below
Sorting:
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆32Updated 2 years ago
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 3 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Updated last year
- 供AI训练的中文数据集(持续更新。。。)与AI公司图谱,目前的 数据集餐饮行业8000问,百度知道,Alpaca中文数据集,计算机领域数据集,Vicuna数据集,RedPajama数据集,Wikipedia中文词条数据集,网站论坛问答数据集☆62Updated 2 years ago
- bge推理优化相关脚本☆29Updated last year
- 时间抽取、解析、标准化工具☆55Updated 3 years ago
- Graph QABot Demo| 图谱问答案例☆15Updated 2 years ago
- 一个基于预训练的句向量生成工具☆138Updated 2 years ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆116Updated 4 months ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆85Updated 5 months ago
- 利用文本分析算法和Python脚本,自动纠正word中的英语单词拼写错误☆48Updated 7 years ago
- ☆27Updated last year
- ChatGLM-6B fine-tuning.☆137Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆116Updated last year
- 采用一个模型同时实现问题生成和答案生成☆29Updated 3 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆68Updated 4 years ago
- Reproduction paper --- PDFTriage : Question Answering over Long, Structured Documents☆43Updated last year
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆45Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated last year
- 文档方向分类☆225Updated last year
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆123Updated 6 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆43Updated 11 months ago
- 介绍docker、docker compose的使用。☆21Updated last year
- The code and data for GrammarGPT.☆178Updated 2 years ago
- 一站式自动化开源标注平台☆78Updated 3 years ago
- 支持ChatGLM2 lora微调☆41Updated 2 years ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆24Updated 3 years ago
- 🌳CED: Catalog Extraction from Documents☆16Updated 2 years ago
- 大语言模型ChatGLM-6B为基座,接入文档阅读功能进行实时问答,可上传txt/docx/pdf多种文件类型。☆43Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 9 months ago