JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding domain.
☆13Nov 5, 2024Updated last year
Alternatives and similar repositories for du-schema
Users that are interested in du-schema are comparing it to the libraries listed below
Sorting:
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Mar 2, 2023Updated 2 years ago
- A fast and highly accurate differentiable Top-k operator from the "Successive Halving Top-k Operator" AAAI'21 paper.☆16Jun 1, 2021Updated 4 years ago
- The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…☆19Oct 28, 2024Updated last year
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 3 years ago
- ☆40Aug 18, 2021Updated 4 years ago
- baselines for DocVQA dataset☆21Apr 11, 2021Updated 4 years ago
- ☆59Aug 18, 2021Updated 4 years ago
- ☆39Feb 7, 2025Updated last year
- Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…☆32Nov 12, 2020Updated 5 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Feb 21, 2026Updated last week
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Feb 5, 2026Updated 3 weeks ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago
- Code and data for the Walert large language model-based chatbot☆12Aug 14, 2025Updated 6 months ago
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 3 years ago
- How to backdoor Diffie-Hellman, lessons learned from the Socat non-prime prime☆11Jun 29, 2021Updated 4 years ago
- Evaluation of GPT-3 for clinical information extraction tasks.☆11Dec 13, 2022Updated 3 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”☆10Apr 3, 2022Updated 3 years ago
- Publicly released code for the LAMBERT model☆105Jun 14, 2021Updated 4 years ago
- Blazing fast signature detection☆11Sep 5, 2022Updated 3 years ago
- prevent XSS attacks by sanitizing html (this is different then escaping!)☆22Oct 14, 2023Updated 2 years ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Rank-Biased Precision, Overlap, Recall, and Alignment☆12Feb 18, 2025Updated last year
- R library for common information retrieval metrics☆14Jun 5, 2023Updated 2 years ago
- ☆13Jun 20, 2024Updated last year
- Fair Benchmarks☆10Mar 14, 2019Updated 6 years ago
- Python wrapper around Yossi Rubner's Earth Mover's Distance implementation (http://ai.stanford.edu/~rubner/emd/default.htm)☆22Jul 9, 2015Updated 10 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- QALD-9-Plus Dataset for Knowledge Graph Question Answering☆12Aug 31, 2022Updated 3 years ago
- A UI designer for constructing AI applications with OpenSearch☆16Updated this week
- Supervised and unsupervised Concept-based explanation of pretrained music classifiers☆12Jul 27, 2023Updated 2 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Converting the Enron email collection to mbox format☆11Dec 9, 2016Updated 9 years ago