KingDandanr/Qwen2-VL-LaTex_OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KingDandanr/Qwen2-VL-LaTex_OCR)

KingDandanr / Qwen2-VL-LaTex_OCR

以Qwen2-VL作为基座多模态大模型，通过指令微调的方式实现特定场景下的OCR，用于学习多模态LLM微调

☆25

Alternatives and similar repositories for Qwen2-VL-LaTex_OCR

Users that are interested in Qwen2-VL-LaTex_OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Wxysnx / ai-memory-system
View on GitHub
A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…
☆11Apr 23, 2025Updated last year
AVC2-UESTC / Frequency-Inspired-Optimization-for-EfficientSR
View on GitHub
[TPAMI 2025] Implementation of "Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution"
☆17Mar 27, 2025Updated last year
charmber / CTF
View on GitHub
CTF信息安全竞赛平台后端服务
☆17Jun 9, 2023Updated 3 years ago
charmber / Cppserve
View on GitHub
C++Web服务框架----使用C++标准库进行开发的web服务框架，支持高并发流量，多种请求格式解析等
☆17Mar 8, 2024Updated 2 years ago
zhangzg1 / rag_with_chat
View on GitHub
基于RAG的知识问答系统，主要结合了 LLM、Langchain、提示工程、优化知识库结构和检索生成流程、vllm 推理优化框架等技术
☆25Mar 12, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jiangnanboy / intent_classification
View on GitHub
深度网络实现意图分类。
☆11Feb 26, 2021Updated 5 years ago
CandleLabAI / TPFNet
View on GitHub
☆11Dec 26, 2022Updated 3 years ago
di37 / chatbot-chatgpt-api
View on GitHub
Chatbot implementation using ChatGPT API and Gradio.
☆14Mar 2, 2023Updated 3 years ago
lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
Shubhendu-Jena / Sparfels
View on GitHub
[ICCV'25] Sparfels: Fast Reconstruction from Sparse Unposed Imagery
☆16Jan 18, 2026Updated 6 months ago
wushilian / CenterNet2CharDet
View on GitHub
☆25Apr 16, 2021Updated 5 years ago
1556761383 / ksjsb-2
View on GitHub
快手极速版
☆14May 18, 2022Updated 4 years ago
lk-aa / LLM-Agent-HandsOn
View on GitHub
本仓库旨在记录和分享我在 LLM 和 Agent 领域的学习历程，并通过实践项目深入理解相关技术。通过从零开始构建基于 LLM 和 Agent 的应用，学习LLM原理和Agent开发经验。
☆26Mar 28, 2025Updated last year
Helen-Cheung / Baidu-AI-Challenge-Scene-Text-Removal
View on GitHub
☆15Feb 28, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
duxiangcheng / SAEN
View on GitHub
Modeling Stroke Mask for End-to-End Text Erasing
☆20Feb 9, 2023Updated 3 years ago
apache / doris-streamloader
View on GitHub
Stream Loader for Apache Doris
☆35May 5, 2026Updated 2 months ago
SCUT-DLVCLab / SCUT-EnsExam
View on GitHub
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…
☆21Updated this week
wanjinchang / SSH-TensorFlow
View on GitHub
This is a TensorFlow implementation of SSH: Single Stage Headless Face Detector
☆32Aug 11, 2019Updated 6 years ago
HarendraKumarSingh / form-extractor-ocr
View on GitHub
Template based form extractor OCR. Train your own character and alphabet OCR.
☆18Oct 22, 2018Updated 7 years ago
ndjasnd / interview-ai
View on GitHub
给有一点python基础的中国宝宝准备的面试辅助工具，完全免费，看不惯这些面试ai的嘴脸（太贵了）！！！
☆25Oct 7, 2025Updated 9 months ago
ailab26 / pfld-lite
View on GitHub
A re-implementation of PFLD, https://arxiv.org/abs/1902.10859
☆45Aug 27, 2019Updated 6 years ago
DuGuQiuBai / Android-Notes
View on GitHub
Android快速入门笔记（11天掌握独孤九剑要领）
☆27Nov 27, 2015Updated 10 years ago
RisabBiswas / T2T-BinFormer
View on GitHub
SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network
☆24Dec 9, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PlusLabNLP / TempGen
View on GitHub
Code for Document-level Entity-based Extraction as Template Generation (EMNLP 2021)
☆29Sep 23, 2021Updated 4 years ago
Ikaros-521 / python_opencv_LPR
View on GitHub
基于python-opencv的车牌识别demo（参考：https://blog.csdn.net/weixin_41695564/article/details/79712393进行了修改）
☆21Nov 25, 2021Updated 4 years ago
RainerSeventeen / MultiRAG-Doc
View on GitHub
☆28Mar 30, 2026Updated 3 months ago
datawhalechina / easy-nlp
View on GitHub
最基本最小白的自然语言处理入门读物，基于deepseek-r1，涵盖了传统NLP和现代大模型
☆30Jan 16, 2026Updated 6 months ago
Osierddr / EAP-GS
View on GitHub
Official repository for the paper "EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstructio…
☆33Jun 15, 2025Updated last year
xiaqunfeng / face-evaluation
View on GitHub
Face evaluation method, such as FDDB, WIDERFace, Megaface, etc.
☆48Apr 24, 2018Updated 8 years ago
AI-Study-Han / Zero-Qwen-VL
View on GitHub
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆82Sep 6, 2024Updated last year
JizhiXiang / video-for-GPT2-chitchat
View on GitHub
Video explanation for GPT2-chitchat in detail / 中文闲聊的GPT2模型(GPT2-chitchat)代码视频详解
☆27Jul 6, 2023Updated 3 years ago
tanguymagne / UVDoc-Dataset
View on GitHub
Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation
☆35May 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gzhuuser / fortune_teller
View on GitHub
本项目将基于多模态,RAG以及LLM等技术，打造了一个基于手相算命的系统
☆30Aug 28, 2024Updated last year
GarrickLin / any_gateway
View on GitHub
A light-weight self-hosted AI API gateway that proxies requests to multiple backend providers (OpenAI, Anthropic, Gemini) with user manag…
☆26Updated this week
zyaocoder / CrackNex
View on GitHub
Official code of ICRA 2024 paper: CrackNex: a Few-shot Low-light Crack Segmentation Model Based on Retinex Theory for UAV Inspections
☆35Feb 16, 2026Updated 5 months ago
Nieson / KBQA-on-Bert
View on GitHub
基于Bert的智能问答系统！
☆29Feb 25, 2020Updated 6 years ago
guoguolord / CrackDataset
View on GitHub
☆36Apr 11, 2021Updated 5 years ago
psgetit / Chinese_Text_Classification_Pytorch
View on GitHub
中文：方便好用的文本分类模型训练加推理全公开！欢迎star后礼貌获取！大体上本项目采用ERINE3.0的base版本将文本转换为语义向量而后做特征进行分类，实测上限极高可以优化后在61分类任务中达到92%准确率。
☆49Mar 20, 2024Updated 2 years ago
TRT2022 / ControlNet_TensorRT
View on GitHub
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛初赛第三名方案
☆50Aug 16, 2023Updated 2 years ago