allenai / s2-folks
Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.
☆222Updated 2 months ago
Alternatives and similar repositories for s2-folks:
Users that are interested in s2-folks are comparing it to the libraries listed below
- Python PDF parser for scientific publications: content and figures☆402Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆363Updated last year
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- Incorporating distribution of experts in order to better predict the future discovery of novel scientific connections☆30Updated last year
- ☆87Updated 11 months ago
- Python client for GROBID Web services☆321Updated last month
- Unofficial Python client library for Semantic Scholar APIs.☆365Updated 2 months ago
- SciRepEval benchmark training and evaluation scripts☆73Updated 11 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated last week
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆77Updated last year
- LitLLM: A Toolkit for Scientific Literature Review☆57Updated last year
- A Python library for OpenAlex (openalex.org)☆227Updated last week
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆342Updated last week
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated 11 months ago
- A collection of Jupyter notebooks, each walking you through a common example of bibliometric analysis using scholarly data from the OpenA…☆114Updated 11 months ago
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆287Updated 6 months ago
- Pretraining Efficiently on S2ORC!☆160Updated 5 months ago
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆78Updated last week
- Aligned, Review-Informed Edits of Scientific Papers☆50Updated last year
- Papers about scientific hypothesis generation with large language models (LLMs).☆62Updated last month
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆61Updated 3 weeks ago
- Discovering Data-driven Hypotheses in the Wild☆71Updated 5 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆92Updated 8 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆43Updated 5 months ago
- Benchmark baseline for retrieval qa applications☆108Updated last year
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆173Updated last year
- Science of Science☆174Updated last month
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆49Updated last year
- A virtual environment for developing and evaluating automated scientific discovery agents.☆143Updated last month