Tencent / POINTS-ReaderLinks
☆149Updated 2 weeks ago
Alternatives and similar repositories for POINTS-Reader
Users that are interested in POINTS-Reader are comparing it to the libraries listed below
Sorting:
- Deep Reasoning Translation (DRT) Project☆233Updated last month
- GLM Series Edge Models☆149Updated 3 months ago
- ☆185Updated 8 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆146Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆251Updated last month
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆154Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆59Updated 2 weeks ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- ☆293Updated 4 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 10 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆183Updated last week
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 7 months ago
- 中文论文、证券类、财报类PDF数据☆34Updated last year
- ☆141Updated last month
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆21Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- ☆488Updated this week
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆202Updated last week
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 4 months ago
- ☆239Updated 7 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆256Updated last week
- Handwritten Text Recognition and Character Detection☆160Updated last week
- ☆79Updated last year
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆80Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.☆89Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆222Updated 3 months ago
- ☆29Updated last year
- ☆57Updated last year