MovePhilip / webformerLinks
unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction
☆12Updated 2 years ago
Alternatives and similar repositories for webformer
Users that are interested in webformer are comparing it to the libraries listed below
Sorting:
- TianGong-AI-Unstructure☆69Updated 2 months ago
- Evaluation for AI apps and agent☆43Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- TechGPT 2.0: Technology-Oriented Generative Pretrained Transformer 2.0☆113Updated 11 months ago
- This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João…☆71Updated 10 months ago
- 通用简单工具项目☆20Updated 10 months ago
- GoGPT中文指令数据集构造☆10Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆47Updated 7 months ago
- 中文预训练ModernBert☆83Updated 4 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- ☆13Updated 5 months ago
- Large-scale exact string matching tool☆17Updated 5 months ago
- auto push daily news with ai☆13Updated this week
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆104Updated last year
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆48Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆48Updated last week
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 7 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- Finance specialized RAG System for the ACM-ICAIF '24 Competition.☆50Updated 8 months ago
- ☆15Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆28Updated last month
- Open replication of DeepSeek R1 for text-to-graph extraction.☆98Updated 6 months ago
- ☆34Updated last year
- 千问14B和7B的逐行解释☆61Updated last year
- ☆37Updated 4 months ago
- All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers☆64Updated last week
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆110Updated 6 months ago
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year