due-benchmark/du-schema

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/due-benchmark/du-schema)

due-benchmark / du-schema

JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding domain.

☆14

Alternatives and similar repositories for du-schema

Users that are interested in du-schema are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

due-benchmark / baselines
View on GitHub
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Mar 2, 2023Updated 3 years ago
applicaai / successive-halving-topk
View on GitHub
A fast and highly accurate differentiable Top-k operator from the "Successive Halving Top-k Operator" AAAI'21 paper.
☆16Jun 1, 2021Updated 5 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
FeatEng / FeatEng
View on GitHub
The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…
☆22Oct 28, 2024Updated last year
applicaai / kleister-charity
View on GitHub
☆40Aug 18, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
applicaai / contract-discovery
View on GitHub
Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…
☆32Nov 12, 2020Updated 5 years ago
applicaai / kleister-nda
View on GitHub
☆61Aug 18, 2021Updated 4 years ago
applicaai / lambert
View on GitHub
Publicly released code for the LAMBERT model
☆106Jun 14, 2021Updated 5 years ago
mineshmathew / DocVQA
View on GitHub
baselines for DocVQA dataset
☆21Apr 11, 2021Updated 5 years ago
google-research-datasets / QuoteSum
View on GitHub
QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …
☆13Mar 25, 2024Updated 2 years ago
uma-pi1 / kgt5-context
View on GitHub
☆12Jun 20, 2024Updated 2 years ago
ModelOriented / MAIR
View on GitHub
Monitoring of AI Regulations
☆19May 30, 2021Updated 5 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
neo4j-graph-examples / twitch
View on GitHub
Twitch Streamer Analysis, see Twitchverse https://towardsdatascience.com/twitchverse-a-network-analysis-of-twitch-universe-using-neo4j-gr…
☆18Oct 25, 2024Updated last year
nuwandavek / justplotme
View on GitHub
☆11Mar 31, 2024Updated 2 years ago
applicaai / CCpdf
View on GitHub
Index of URLs to pdf files all over the internet and scripts
☆25May 2, 2023Updated 3 years ago
monterail / elasticsearch-analysis-morfologik
View on GitHub
Morfologik (Polish) Analysis Plugin for ElasticSearch
☆24Oct 5, 2015Updated 10 years ago
datatagsuite / schema
View on GitHub
DATS JSON schemas
☆13Dec 21, 2022Updated 3 years ago
expertailab / ISAAQ
View on GitHub
☆10Oct 1, 2020Updated 5 years ago
adlnlp / form_nlu
View on GitHub
☆19Nov 1, 2024Updated last year
PaulDance / cargo-liner
View on GitHub
Cargo subcommand to install and update binary packages listed in configuration
☆19Jul 14, 2026Updated last week
chunchiehy / musst
View on GitHub
Multi-span Style Extraction for Generative Reading Comprehension
☆10Apr 2, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AIM3-RUC / MPMQA
View on GitHub
Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)
☆21Nov 28, 2022Updated 3 years ago
mnamysl / nat-acl2020
View on GitHub
☆15May 26, 2021Updated 5 years ago
angelhof / flumina
View on GitHub
A parallel programming model for online applications with complex synchronization requirements.
☆16Jun 8, 2022Updated 4 years ago
cl-tohoku / AIO2_DPR_baseline
View on GitHub
https://www.nlp.ecei.tohoku.ac.jp/projects/aio/
☆16Aug 4, 2022Updated 3 years ago
cvzoya / visuallydata
View on GitHub
A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations
☆16Oct 8, 2018Updated 7 years ago
decompositional-semantics-initiative / improved-ParaBank-rewriter
View on GitHub
Improved ParaBank Rewriter
☆22Jan 22, 2020Updated 6 years ago
nikhilanand03 / paper-video
View on GitHub
Turn research into video
☆22Apr 15, 2026Updated 3 months ago
bzhangGo / st_from_scratch
View on GitHub
Revisiting End-to-End Speech-to-Text Translation From Scratch
☆13Feb 21, 2023Updated 3 years ago
Phantomical / perf-event
View on GitHub
perf-event: a Rust interface to Linux performance monitoring
☆23Aug 14, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sellerskyle / audio-effect-suite
View on GitHub
☆10Jul 22, 2021Updated 4 years ago
oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
bigwater / gpunfa-artifact
View on GitHub
☆19Nov 21, 2022Updated 3 years ago
CPJKU / composer_concept
View on GitHub
Supervised and unsupervised Concept-based explanation of pretrained music classifiers
☆12Jul 27, 2023Updated 2 years ago
applicaai / poleval-2018
View on GitHub
Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."
☆27Dec 8, 2022Updated 3 years ago
salesforce / QVR-SimpleDLM
View on GitHub
Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
☆16May 1, 2025Updated last year
AwalkZY / CPN
View on GitHub
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
☆10Apr 3, 2022Updated 4 years ago