《人工智能原理》课程设计(基于Resnet-Transformer的手写数学表示式识别)
☆17Jan 10, 2023Updated 3 years ago
Alternatives and similar repositories for HMER
Users that are interested in HMER are comparing it to the libraries listed below
Sorting:
- ☆14Jun 10, 2025Updated 8 months ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆19Aug 23, 2025Updated 6 months ago
- FFmpeg移植☆12Jan 7, 2019Updated 7 years ago
- ☆13Sep 25, 2019Updated 6 years ago
- 神经网络大作业:公式识别,两种模型(CNN+RNN ResNet+Transformer)☆16May 14, 2022Updated 3 years ago
- 学习多媒体系列教程(从最基础的C语法->编写实用的C++播放器->前沿音视频处理技术)☆15Apr 21, 2019Updated 6 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- Video-Recorder based on qt, ffmpeg and C++☆23Aug 22, 2021Updated 4 years ago
- ocr,pdf转docx,pdf to docx☆23Nov 4, 2022Updated 3 years ago
- 利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images☆27May 5, 2023Updated 2 years ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆38Feb 1, 2026Updated last month
- GAKE: Graph Aware Knowledge Embedding(COLING2016)☆28Jan 19, 2019Updated 7 years ago
- Android开发-蓝牙(BlueTooth)设备检测连接的实现☆27May 21, 2018Updated 7 years ago
- [UbiComp/IMWUT '23] Hierarchical Clustering-based Personalized Federated Learning for Robust and Fair Human Activity Recognition☆32Oct 20, 2025Updated 4 months ago
- [IoTDI 2023/ML4IoT 2023] Async-HFL: Efficient and Robust Asynchronous Federated Learning in Hierarchical IoT Networks☆41Apr 4, 2023Updated 2 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Sep 9, 2024Updated last year
- ☆35May 23, 2021Updated 4 years ago
- FLIS: Clustered Federated Learning via Inference Similarity for Non-IID Data Distribution☆40Nov 13, 2022Updated 3 years ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆59Apr 28, 2023Updated 2 years ago
- ☆70Jun 26, 2024Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆93Oct 24, 2024Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆79Sep 6, 2024Updated last year
- Papers related to federated learning in top conferences (2020-2024).☆69Oct 14, 2024Updated last year
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆84Apr 10, 2025Updated 10 months ago
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- The code and data of We-Math 2.0.☆164Aug 30, 2025Updated 6 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆86Nov 10, 2024Updated last year
- ☆109Sep 24, 2025Updated 5 months ago
- Qianfan-VL: Domain-Enhanced Universal Vision-Language Models☆181Sep 22, 2025Updated 5 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆120Aug 27, 2024Updated last year
- OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正☆93Nov 27, 2020Updated 5 years ago
- ☆101Dec 22, 2023Updated 2 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆130Nov 6, 2024Updated last year
- GLM Series Edge Models☆157Jun 12, 2025Updated 8 months ago
- 📝 A collection of common datasets used in knowledge embedding☆156Mar 22, 2020Updated 5 years ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆149Sep 10, 2024Updated last year
- [CVPR2023] Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution☆189Feb 4, 2026Updated last month