β16Dec 10, 2023Updated 2 years ago
Alternatives and similar repositories for DevNet
Users that are interested in DevNet are comparing it to the libraries listed below
Sorting:
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from π€Diffusers.β17May 18, 2023Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking fβ¦β20Dec 4, 2024Updated last year
- β24Feb 26, 2023Updated 3 years ago
- β22May 30, 2023Updated 2 years ago
- β17Jul 9, 2024Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)β67Jun 6, 2024Updated last year
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoratiβ¦β55Dec 22, 2025Updated 2 months ago
- β17Jul 24, 2025Updated 7 months ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformerβ78Apr 9, 2024Updated last year
- β23Oct 10, 2024Updated last year
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)β24May 26, 2025Updated 9 months ago
- A simple 2D ball collision engine.β12Jun 15, 2023Updated 2 years ago
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documentsβ107Jul 15, 2025Updated 8 months ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoningβ73Dec 17, 2025Updated 3 months ago
- β19Sep 11, 2024Updated last year
- Real-CE: A Benchmark for Chinese-English Scene Text Image Super-resolution (ICCV2023)β96Nov 3, 2023Updated 2 years ago
- β39Jun 19, 2023Updated 2 years ago
- β27Mar 7, 2025Updated last year
- A modified version of the cart-pole OpenAI Gym environment for testing different control policiesβ13Jul 15, 2024Updated last year
- β21Jun 16, 2021Updated 4 years ago
- Model-Agnostic Meta-Learning in PyTorchβ11Jul 31, 2020Updated 5 years ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20β¦β62Jul 4, 2024Updated last year
- A toolbox for EEG signals processing. Welcome to join and build!β13Nov 9, 2022Updated 3 years ago
- Training a transformer to generate cursive handwritingβ33Apr 2, 2025Updated 11 months ago
- Modeling Stroke Mask for End-to-End Text Erasingβ19Feb 9, 2023Updated 3 years ago
- β25May 31, 2024Updated last year
- β14May 4, 2024Updated last year
- This repo is the official implementation of DeepCalliFont: Few-shot Chinese Calligraphy Font Synthesis by Integrating Dual-modality Generβ¦β31May 11, 2024Updated last year
- Another ChatGLM2 implementation for GPTQ quantizationβ55Oct 15, 2023Updated 2 years ago
- -β24Oct 25, 2022Updated 3 years ago
- Beer Game implemented as an OpenAI gym environment.β17Aug 4, 2019Updated 6 years ago
- Python implementation of a Minimal Active Inference Agentβ17Feb 9, 2023Updated 3 years ago
- A Survey of Multimodal Retrieval-Augmented Generationβ20Nov 3, 2025Updated 4 months ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?β35Dec 1, 2025Updated 3 months ago
- β67Apr 18, 2024Updated last year
- Retrieval-Augmented Decision Transformer: External Memory for In-context RLβ24Oct 27, 2024Updated last year
- The tampered text detection datasetβ22Aug 23, 2023Updated 2 years ago
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Frameworkβ14May 31, 2023Updated 2 years ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environmentsβ48Jan 8, 2026Updated 2 months ago