KingDandanr / Qwen2-VL-LaTex_OCRView external linksLinks
以Qwen2-VL作为基座多模态大模型,通过指令微调的方式实现特定场景下的OCR,用于学习多模态LLM微调
☆22Jan 18, 2025Updated last year
Alternatives and similar repositories for Qwen2-VL-LaTex_OCR
Users that are interested in Qwen2-VL-LaTex_OCR are comparing it to the libraries listed below
Sorting:
- 🏆 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting☆18Feb 4, 2026Updated 2 weeks ago
- [ICCV25] MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions☆19Oct 14, 2025Updated 4 months ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated 11 months ago
- ☆11Dec 26, 2022Updated 3 years ago
- Chatbot implementation using ChatGPT API and Gradio.☆14Mar 2, 2023Updated 2 years ago
- [AAAI' 26]SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction☆26Nov 19, 2025Updated 2 months ago
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆14Dec 21, 2023Updated 2 years ago
- 面试辅助系统是一个基于AI的工具,可以将面试官的音频实时转换为文字,并提供合适的回答。支持知识库方案。☆23Mar 25, 2025Updated 10 months ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆23Jan 16, 2026Updated last month
- Packaged TResNet based on Official PyTorch Implementation☆15Oct 26, 2020Updated 5 years ago
- deploy onnx models with TensorRT and LibTorch☆19Nov 17, 2021Updated 4 years ago
- Websocket 的使用示例(Python3和Html的源代码)☆16Jan 14, 2020Updated 6 years ago
- BMInf demos.☆16Oct 14, 2021Updated 4 years ago
- Modeling Stroke Mask for End-to-End Text Erasing☆19Feb 9, 2023Updated 3 years ago
- Official code of ICRA 2024 paper: CrackNex: a Few-shot Low-light Crack Segmentation Model Based on Retinex Theory for UAV Inspections☆30May 23, 2025Updated 8 months ago
- ☆57Dec 23, 2025Updated last month
- 基于python-opencv的车牌识别demo(参考:https://blog.csdn.net/weixin_41695564/article/details/79712393进行了修改)☆21Nov 25, 2021Updated 4 years ago
- 2021 搜狐校园文本匹配算法大赛方案☆18Nov 7, 2024Updated last year
- Project on the assignment of ICD codes to medical/clinical text☆21Jun 12, 2023Updated 2 years ago
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- Wavelet-based Image InPainting Model☆26Sep 15, 2024Updated last year
- 主干网络替换为了改进的ResNet50☆22Nov 4, 2023Updated 2 years ago
- Code from our paper "Template-guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction " (ICCVW) 2023.☆27Feb 7, 2024Updated 2 years ago
- 新浪微博签到、电信手机签到、快手极速版签到、爱奇艺签到、wps签到 / 快手刷视频,今日头条刷金币、百度极速版刷金币☆26Mar 4, 2021Updated 4 years ago
- ☆25Apr 16, 2021Updated 4 years ago
- 本项目将基于多模态,RAG以及LLM等技术,打造了一个基于手相算命的系统☆30Aug 28, 2024Updated last year
- 微信自动阅读助手☆28Jan 12, 2025Updated last year
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆31May 27, 2024Updated last year
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 8 months ago
- ☆30Sep 2, 2019Updated 6 years ago
- FETNet: Feature Erasing and Transferring Network for Scene Text Removal☆35Jul 18, 2023Updated 2 years ago
- Package nett steals from the standard library's net package and provides a dialer with a pluggable host resolver.☆38Jan 17, 2015Updated 11 years ago
- 基于Bert的智能问答系统!☆30Feb 25, 2020Updated 5 years ago
- Manage 2G/3G/4G/5G modules through the web☆49Aug 15, 2025Updated 6 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆48Aug 26, 2024Updated last year
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆51Aug 28, 2025Updated 5 months ago
- ncnn demo of (文档矫正)DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction☆44Jan 9, 2022Updated 4 years ago
- 使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory☆56Sep 8, 2024Updated last year
- Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…☆58Feb 7, 2024Updated 2 years ago