liunian-Jay / MU-GOT
pdf 解析
☆31Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MU-GOT
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆129Updated 5 months ago
- 个人项目地址,一些大语言模型和多模态模型的应用☆123Updated 2 weeks ago
- ☆156Updated 8 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆199Updated last month
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆68Updated 2 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆150Updated 2 weeks ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆242Updated 2 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆99Updated last month
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆164Updated last month
- ☆67Updated this week
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆139Updated last week
- ☆55Updated 10 months ago
- 文档方向分类☆203Updated this week
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆79Updated last week
- Document Artifical Intelligence☆131Updated last month
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆155Updated 8 months ago
- ☆127Updated 9 months ago
- A Toolkit for Table-based Question Answering☆105Updated last year
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆76Updated this week
- Train a 1B LLM with 1T tokens from scratch by personal☆347Updated last week
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆69Updated 2 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆53Updated 3 weeks ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆142Updated last year
- ☆106Updated 9 months ago
- The official PyTorch implementation of SEMv3.☆27Updated 5 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆174Updated this week
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆269Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆59Updated last week