opendatalab / mineru-vl-utilsView external linksLinks
A Python package for interacting with the MinerU Vision-Language Model.
☆103Feb 5, 2026Updated last week
Alternatives and similar repositories for mineru-vl-utils
Users that are interested in mineru-vl-utils are comparing it to the libraries listed below
Sorting:
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 9 months ago
- DELT: Data Efficacy for Language Model Training☆43Updated this week
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆455Sep 28, 2025Updated 4 months ago
- ☆11Oct 31, 2024Updated last year
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Mo…☆12Jul 6, 2024Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 10 months ago
- ☆18Feb 16, 2025Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 2 months ago
- 人工智能与深度学习实战 - 深度学习篇☆15Nov 8, 2025Updated 3 months ago
- Use to store public paper and organize them.☆18Feb 26, 2021Updated 4 years ago
- Long Context Research☆26Jan 26, 2026Updated 2 weeks ago
- ☆21Jun 16, 2025Updated 8 months ago
- An open-source Agent Skill framework implementing progressive disclosure architecture☆40Jan 30, 2026Updated 2 weeks ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 9 months ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- Implementation (in progress) of Dieng et al.'s TopicRNN intended to be used as a baseline and starting point.☆10Jun 26, 2018Updated 7 years ago
- 工业级中文语音识别系统电子书☆13Oct 30, 2020Updated 5 years ago
- 밑바닥부터 시작하는 딥러닝 2! 판교에서 진행중 <3☆12Aug 20, 2019Updated 6 years ago
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- MegaRAG: Multimodal Graph-based RAG☆33Sep 16, 2025Updated 5 months ago
- ☆11Aug 27, 2020Updated 5 years ago
- This is some of my Python technical books collection☆13Sep 26, 2013Updated 12 years ago
- ☆18Jun 14, 2025Updated 8 months ago
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.☆12May 17, 2025Updated 8 months ago
- ☆13Apr 2, 2024Updated last year
- ☆13Feb 14, 2024Updated 2 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆28Jan 18, 2026Updated 3 weeks ago
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆48Jul 23, 2024Updated last year
- Data Set Description Language Specification (新一代人工智能数据集描 述语言DSDL)☆47May 29, 2024Updated last year
- 基于MP-CNN的中文句子相似度计算☆13Jun 26, 2018Updated 7 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- Chatbot_CN项目的知识图谱模块☆12Mar 27, 2020Updated 5 years ago
- ☆21Jul 24, 2025Updated 6 months ago
- ☆12Jan 9, 2024Updated 2 years ago