allenai / pdf-component-libraryLinks
☆88Updated last year
Alternatives and similar repositories for pdf-component-library
Users that are interested in pdf-component-library are comparing it to the libraries listed below
Sorting:
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆46Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆454Updated last year
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆258Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆107Updated 9 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆220Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- ☆116Updated 3 months ago
- Unofficial Python client library for Semantic Scholar APIs.☆430Updated last month
- PDF parser powered by grobid☆28Updated last year
- A spaCy wrapper for GliNER☆129Updated last year
- library supporting NLP and CV research on scientific papers☆788Updated last year
- Get answers to research questions from 200M+ papers. Link to demo -☆207Updated 3 months ago
- ☆108Updated 3 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆319Updated last year
- Benchmarking PDF libraries☆321Updated 7 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆224Updated 5 months ago
- Python client for GROBID Web services☆387Updated 3 weeks ago
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆253Updated last week
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆28Updated 3 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆642Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆446Updated last year
- ☆201Updated last week
- Semantic search engine indexing 110 million academic publications☆99Updated 2 weeks ago
- ☆39Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 10 months ago
- A dataset for pretraining language models targeted for legal tasks.☆141Updated 3 years ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆297Updated 2 weeks ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆115Updated 6 months ago
- Awesome deliberative prompting: How to ask LLMs to produce reliable reasoning and make reason-responsive decisions.☆120Updated last year
- Dataset and annotations for ASSETS 2022 publication☆12Updated 3 years ago