deepcpcfg / datasetsLinks
Supplementary materials for DeepCPCFG
☆23Updated 4 years ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- Experimental form data extraction for journalism☆77Updated 4 years ago
- Automatically labeling training data☆107Updated 6 years ago
- Machine Learning for Information Retrieval☆86Updated 2 months ago
- Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper☆92Updated 3 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 10 months ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- Self-training with Weak Supervision (NAACL 2021)☆160Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆203Updated 2 years ago
- A collection of simple tutorials for using Fonduer☆100Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆68Updated 3 years ago
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- ☆58Updated 3 years ago
- How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.☆134Updated 3 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆128Updated 4 years ago
- This is a helper for PyTorch-BigGraph☆22Updated 5 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Implementation of SiameseXML (ICML 2021)☆40Updated 2 years ago
- Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/☆246Updated last year
- An extensible framework for building visualization and annotation tools to enable better interaction with NLP and Artificial Intelligence…☆49Updated 2 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆114Updated 5 years ago
- ☆9Updated 3 years ago
- A deep learning framework for building multimodal multi-task learning systems.☆111Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆63Updated 11 months ago
- A repository with anonymized invoices☆12Updated 6 years ago
- This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peete…☆38Updated 2 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆233Updated 2 years ago
- Multitask Learning with Pretrained Transformers☆40Updated 4 years ago