allenai / pdf-component-libraryLinks
☆80Updated last year
Alternatives and similar repositories for pdf-component-library
Users that are interested in pdf-component-library are comparing it to the libraries listed below
Sorting:
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆440Updated last year
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆43Updated 11 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆242Updated 8 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆100Updated 5 months ago
- library supporting NLP and CV research on scientific papers☆785Updated 11 months ago
- PDF parser powered by grobid☆28Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 8 months ago
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆132Updated 4 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated 3 weeks ago
- Python client for GROBID Web services☆364Updated last week
- ☆102Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- ☆88Updated 10 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆295Updated last year
- ☆37Updated last year
- multimodal document analysis☆167Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆440Updated last year
- 📄 ⚙️ ETL processes for medical and scientific papers☆398Updated 2 months ago
- ☆98Updated last year
- ☆193Updated this week
- SciRepEval benchmark training and evaluation scripts☆76Updated last year
- 🤖🌊 aiFlows: The building blocks of your collaborative AI☆272Updated last year
- Unofficial Python client library for Semantic Scholar APIs.☆407Updated 3 weeks ago
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆161Updated 5 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆189Updated 4 months ago
- ☆43Updated 2 months ago
- Python PDF parser for scientific publications: content and figures☆434Updated last year
- Social and customizable AI writing assistant! ✍️☆253Updated 3 months ago