allenai / s2-folks
Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.
☆226Updated 3 months ago
Alternatives and similar repositories for s2-folks:
Users that are interested in s2-folks are comparing it to the libraries listed below
- ☆89Updated 11 months ago
- Unofficial Python client library for Semantic Scholar APIs.☆370Updated 2 months ago
- SciRepEval benchmark training and evaluation scripts☆74Updated 11 months ago
- Incorporating distribution of experts in order to better predict the future discovery of novel scientific connections☆31Updated last year
- Python client for GROBID Web services☆324Updated 2 months ago
- Python PDF parser for scientific publications: content and figures☆404Updated last year
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆288Updated 7 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆42Updated 6 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆394Updated last year
- Code/data for MARG (multi-agent review generation)☆43Updated 5 months ago
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆350Updated last week
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆79Updated last year
- LitLLM: A Toolkit for Scientific Literature Review☆60Updated 3 weeks ago
- A Python library for OpenAlex (openalex.org)☆234Updated last month
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- PDF parsing toolkit for preparing academic text corpus☆56Updated 9 months ago
- Aligned, Review-Informed Edits of Scientific Papers☆51Updated last year
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆922Updated last year
- Discovering Data-driven Hypotheses in the Wild☆80Updated 5 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆175Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆83Updated last week
- Science-parse version 2☆244Updated 5 years ago
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated 3 weeks ago
- To automate the SLR process and write paper quickly using multi agents of AI☆41Updated last year
- ☆74Updated last year
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆44Updated 6 months ago
- ☆93Updated 11 months ago
- ☆38Updated 5 months ago
- Pretraining Efficiently on S2ORC!☆163Updated 6 months ago