MurtuzaBohra / SimpDOM
Simplified DOM Trees for Transferable Attribute Extraction from the Web
☆38Updated 4 months ago
Alternatives and similar repositories for SimpDOM:
Users that are interested in SimpDOM are comparing it to the libraries listed below
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆47Updated 2 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆124Updated 3 years ago
- The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. …☆139Updated 2 years ago
- ☆84Updated 5 months ago
- ☆12Updated 2 years ago
- Unofficial Pytorch implementation of Dom-LM paper.☆33Updated last year
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.☆59Updated last year
- X-BERT: eXtreme Multi-label Text Classification with BERT☆52Updated 5 years ago
- A simple example for finetuning HuggingFace T5 model. Includes code for intermediate generation.☆27Updated 4 years ago
- [EMNLP 2021] The baseline code for WebSRC dataset.☆49Updated 2 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Updated 3 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆132Updated 8 months ago
- A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!☆92Updated 2 years ago
- ☆57Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 9 months ago
- pytorch implementation of the TwinBert paper☆39Updated 3 years ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆67Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 3 years ago
- ☆13Updated 4 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆97Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆75Updated 2 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆56Updated 3 years ago
- Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generat…☆15Updated 2 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆28Updated 2 years ago
- Implement Retrospective Reader for Machine Reading Comprehension with 🤗 transformers and datasets☆19Updated 2 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆68Updated last year
- Unified Learned Sparse Retrieval Framework☆63Updated 9 months ago