oasisresearchlab / context24
Dataset repository for SDPROC SHared Task: Context24: Contextualizing Scientific Figures and Tables
☆18Updated 3 months ago
Related projects: ⓘ
- ☆34Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆63Updated last year
- ☆31Updated 8 months ago
- Dataset accompanying the SPECTER model☆127Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆54Updated 4 months ago
- MultiCite code and data. Models are available on Huggingface.☆28Updated 2 years ago
- SciRepEval benchmark training and evaluation scripts☆67Updated 4 months ago
- Generating claims for zero-shot scientific fact checking☆29Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- ☆18Updated last year
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆18Updated 8 months ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆16Updated 2 months ago
- ☆41Updated last month
- Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles☆42Updated 3 months ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆42Updated last week
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆17Updated last year
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆13Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆29Updated 2 years ago
- This is the official repository for "CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data An…☆14Updated 10 months ago
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆44Updated last year
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated 2 months ago
- Poor man's simple harvester for arXiv resources☆11Updated last year
- Frame Semantic Parser based on T5 and FrameNet☆51Updated last year
- ☆78Updated 4 months ago
- A Large Semantic Knowledge Graph from Wikipedia Categories and Listings☆24Updated last year
- ☆19Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆25Updated last month
- Collection of public APIs for embedding scientific papers☆53Updated 3 years ago
- Scripts used to make and evaluate OpenAlex's concept tagging model☆48Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆27Updated last year