以Qwen2-VL作为基座多模态大模型,通过指令微调的方式实现特定场景下的OCR,用于学习多模态LLM微调
☆25Jan 18, 2025Updated last year
Alternatives and similar repositories for Qwen2-VL-LaTex_OCR
Users that are interested in Qwen2-VL-LaTex_OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🏆 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting☆33Apr 10, 2026Updated 2 weeks ago
- deploy onnx models with TensorRT and LibTorch☆19Nov 17, 2021Updated 4 years ago
- ☆30Feb 27, 2025Updated last year
- [ICCV25] MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions☆20Feb 19, 2026Updated 2 months ago
- [AAAI' 26]SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction☆27Nov 19, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Dec 26, 2022Updated 3 years ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 3 months ago
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- ☆22Mar 11, 2025Updated last year
- Packaged TResNet based on Official PyTorch Implementation☆15Oct 26, 2020Updated 5 years ago
- Chatbot implementation using ChatGPT API and Gradio.☆14Mar 2, 2023Updated 3 years ago
- ☆25Apr 16, 2021Updated 5 years ago
- ☆30Sep 2, 2019Updated 6 years ago
- Websocket 的使用示例(Python3和Html的源代码)☆16Jan 14, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official code of ICRA 2024 paper: CrackNex: a Few-shot Low-light Crack Segmentation Model Based on Retinex Theory for UAV Inspections☆33Feb 16, 2026Updated 2 months ago
- ☆15Feb 28, 2022Updated 4 years ago
- Modeling Stroke Mask for End-to-End Text Erasing☆19Feb 9, 2023Updated 3 years ago
- 对机器学习、概率图模型、主题模型领域一些模型进行实现,主要涉及一些近年高水平会议论文中提到的算法。☆19May 19, 2017Updated 8 years ago
- ☆57Dec 23, 2025Updated 4 months ago
- This is a TensorFlow implementation of SSH: Single Stage Headless Face Detector☆32Aug 11, 2019Updated 6 years ago
- koahub.js 简单的后台内容管理系统☆37Jan 17, 2018Updated 8 years ago
- A re-implementation of PFLD, https://arxiv.org/abs/1902.10859☆45Aug 27, 2019Updated 6 years ago
- Template based form extractor OCR. Train your own character and alphabet OCR.☆18Oct 22, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 2021 搜狐校园文本匹配算法大赛方案☆19Nov 7, 2024Updated last year
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- Code for Document-level Entity-based Extraction as Template Generation (EMNLP 2021)☆29Sep 23, 2021Updated 4 years ago
- 基于python-opencv的车牌识别demo(参考:https://blog.csdn.net/weixin_41695564/article/details/79712393进行了修改)☆21Nov 25, 2021Updated 4 years ago
- 对langchain-ChatGLM项目各模块进行注释,增加了一些新的特性,修复了一些bug☆25Nov 10, 2023Updated 2 years ago
- Wavelet-based Image InPainting Model☆27Sep 15, 2024Updated last year
- 竞争性自适应重加权采样法(competitive adapative reweighted sampling, CARS)python代码☆24Apr 20, 2022Updated 4 years ago
- Code from our paper "Template-guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction " (ICCVW) 2023.☆28Feb 7, 2024Updated 2 years ago
- Project on the assignment of ICD codes to medical/clinical text☆22Jun 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 本项目将基于多模态,RAG以及LLM等技术,打造了一个基于手相算命的系统☆31Aug 28, 2024Updated last year
- 基于RNN的中文分词☆25Jun 30, 2017Updated 8 years ago
- Video explanation for GPT2-chitchat in detail / 中文闲聊的GPT2模型(GPT2-chitchat)代码视频详解☆27Jul 6, 2023Updated 2 years ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆32May 27, 2024Updated last year
- 网易云课堂 日月光华 Pytorch深度学习入门与实战 课程 配套课程代码☆42Aug 18, 2022Updated 3 years ago
- FETNet: Feature Erasing and Transferring Network for Scene Text Removal☆35Jul 18, 2023Updated 2 years ago
- 基于Bert的智能问答系统!☆30Feb 25, 2020Updated 6 years ago