☆37Jan 26, 2026Updated 3 months ago
Alternatives and similar repositories for texannotate
Users that are interested in texannotate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆41Dec 7, 2023Updated 2 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- ☆40Apr 6, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Feb 20, 2024Updated 2 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- pytorch-TripletSemiHardLoss☆10Jan 12, 2022Updated 4 years ago
- The source code for paper--MORE: A Metric learning based framework for Open-domain Relation Extraction.☆12Jan 15, 2021Updated 5 years ago
- Simple ChatGPT interface for shell and macOS Alfred workflow☆13Oct 3, 2025Updated 7 months ago
- Automated Semantic Analysis of Discourse Markers☆11May 30, 2022Updated 3 years ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated last year
- ☆78Aug 7, 2023Updated 2 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Auto updater for portable application.☆13Apr 24, 2026Updated 3 weeks ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 9 months ago
- Pytorch implements SA-Text: Simple but Accurate Detector for Text of Arbitrary Shapes☆42Jun 25, 2020Updated 5 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Avalonia SkiaSharp Fiddle is a SkiaSharp playground created with Avalonia and running on macOS, Linux, Windows and WebAssembly.☆13Mar 7, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Demo app for CodeEditorView☆35Jun 8, 2025Updated 11 months ago
- ☆13Oct 16, 2020Updated 5 years ago
- Python验证码生成工具☆11Mar 5, 2022Updated 4 years ago
- ☆18Jun 12, 2024Updated last year
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- [ICASSP'23] PAGE: A Position-Aware Graph-based model for Emotion cause entailment☆16Jun 1, 2023Updated 2 years ago
- A .NET library for integrating virtualising and paging data for UIs☆17Oct 7, 2025Updated 7 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ICC Profiles☆11Aug 30, 2018Updated 7 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Jan 18, 2024Updated 2 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- ☆19Oct 10, 2020Updated 5 years ago
- XETBook, a free version of Bembo☆16Apr 25, 2026Updated 3 weeks ago
- Unofficial PyTorch implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks☆191May 12, 2020Updated 6 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Apr 15, 2019Updated 7 years ago