shauryr / S2QA
Get answers to research questions from 200M+ papers. Link to demo -
☆203Updated 8 months ago
Related projects: ⓘ
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆332Updated 5 months ago
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆257Updated 11 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆166Updated last week
- Python PDF parser for scientific publications: content and figures☆328Updated 5 months ago
- Unofficial Python client library for Semantic Scholar APIs.☆287Updated 2 months ago
- ☆81Updated 3 months ago
- ☆78Updated 4 months ago
- A proof of concept to scrape papers from journals☆227Updated 3 months ago
- Semantic search engine indexing 95 million academic publications☆76Updated last year
- SciRepEval benchmark training and evaluation scripts☆67Updated 4 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆106Updated 10 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆63Updated 5 months ago
- Tools to scrape publication metadata from pubmed, arxiv, medrxiv and chemrxiv.☆211Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- ☆91Updated 5 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆124Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆118Updated 6 months ago
- Medical reasoning using large language models☆83Updated 8 months ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆293Updated 8 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆68Updated last week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆143Updated 2 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆70Updated last month
- An open science effort to benchmark legal reasoning in foundation models☆325Updated 3 weeks ago
- Python client for GROBID Web services☆279Updated 3 weeks ago
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆168Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆201Updated 6 months ago
- ☆47Updated last year
- multimodal document analysis☆159Updated 3 months ago
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆107Updated last year