MovePhilip / webformerLinks
unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction
☆13Updated 2 years ago
Alternatives and similar repositories for webformer
Users that are interested in webformer are comparing it to the libraries listed below
Sorting:
- [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆60Updated last year
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆24Updated 3 years ago
- TianGong-AI-Unstructure☆69Updated this week
- Evaluation for AI apps and agent☆44Updated 2 years ago
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆114Updated last year
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆51Updated 3 years ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- 本项目由三个 模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Updated 4 years ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- 用于微调LLM的中文指令数据集☆28Updated 2 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Updated 11 months ago
- share data, prompt data , pretraining data☆36Updated 2 years ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- aigc evals☆10Updated 2 years ago
- accelerate generating vector by using onnx model☆18Updated 2 years ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆42Updated 10 months ago
- GoGPT中文指令数据集构造☆10Updated 2 years ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆50Updated last month
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆49Updated 10 months ago
- The LLM of NL2GQL with NebulaGraph or Neo4j☆97Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated 2 years ago
- 通用简单工具项目☆22Updated last year
- BLOOM 模型的指令微调☆24Updated 2 years ago