MurtuzaBohra / SimpDOMLinks
Simplified DOM Trees for Transferable Attribute Extraction from the Web
☆40Updated last year
Alternatives and similar repositories for SimpDOM
Users that are interested in SimpDOM are comparing it to the libraries listed below
Sorting:
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆49Updated 3 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60Updated 2 years ago
- Language models are open knowledge graphs ( non official implementation )☆170Updated 5 years ago
- The unified platform for data-related resources.☆134Updated 2 years ago
- The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. …☆150Updated 2 years ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆106Updated last year
- ☆86Updated 7 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated 3 months ago
- ☆80Updated last year
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated 3 months ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆104Updated 2 years ago
- A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!☆93Updated 8 months ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆72Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆128Updated 3 years ago
- A Framework for Textual Entailment based Zero Shot text classification☆153Updated last year
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆73Updated 3 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 3 years ago
- ☆13Updated 3 years ago
- Open source library for few shot NLP☆78Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated 2 years ago
- ☆184Updated 2 years ago
- Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Quest…☆130Updated 4 years ago
- Source for the ACL 2021 Findings paper "Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models"☆50Updated 2 years ago
- SQuARE: Software for question answering research.☆75Updated last year
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆294Updated 4 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆22Updated 3 years ago
- [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction☆310Updated 2 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 4 years ago