PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.
☆65Nov 7, 2024Updated last year
Alternatives and similar repositories for MU-GOT
Users that are interested in MU-GOT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0☆15Mar 6, 2025Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Oct 24, 2024Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- This repository contains the original Python code for "Spatial Distillation-based Distribution Alignment (SDDA) for Cross-Headset EEG Cla…☆30Nov 17, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [AAAI 2024] ICMVC: Incomplete Contrastive Multi-View Clustering with High-confidence Guiding☆33Jul 22, 2024Updated last year
- 💡 Awesome RAG: A resource of Retrieval-Augmented Generation (RAG) for LLMs, focusing on the development of technology.☆449Feb 13, 2026Updated last month
- Fair and comprehensive benchmarking for open source EEG foundation models.☆87Feb 6, 2026Updated last month
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆111Oct 9, 2025Updated 5 months ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆416Sep 4, 2025Updated 6 months ago
- ☆57Jan 23, 2024Updated 2 years ago
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,897Dec 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Using pytorch to implement MobileViT from Apple framework☆20Apr 24, 2023Updated 2 years ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆159Jul 28, 2025Updated 8 months ago
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,100Feb 10, 2025Updated last year
- Pointers to large-scale underwater datasets and relevant resources.☆13May 22, 2025Updated 10 months ago
- This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"☆17Jun 21, 2023Updated 2 years ago
- 通过浏览器渲染生成表格图像☆238Apr 10, 2024Updated last year
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆276Dec 6, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- WIP. Apps (100+) + AI.☆31Sep 2, 2024Updated last year
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 2 months ago
- ☆13Feb 6, 2022Updated 4 years ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- A curated list of personalized alignment resources (continually updated).☆66Mar 12, 2026Updated 2 weeks ago
- Style-Text data synthesis tool☆79Dec 9, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- wePoker is a multi-player poker game for Android☆12Mar 20, 2013Updated 13 years ago
- Implementation of Unsupervised Pixel–Level Domain Adaptation with Generative Adversarial Networks by Google☆16Jan 10, 2017Updated 9 years ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,072Apr 14, 2025Updated 11 months ago
- 变化检测的相关工作,一些复现代码和自己的工作☆10Sep 24, 2019Updated 6 years ago
- ☆14Oct 17, 2024Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 9 months ago
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"☆22Mar 30, 2024Updated last year