allenai / pdf-component-libraryLinks
☆77Updated last year
Alternatives and similar repositories for pdf-component-library
Users that are interested in pdf-component-library are comparing it to the libraries listed below
Sorting:
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆98Updated 3 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆427Updated last year
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆42Updated 8 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆236Updated 6 months ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆394Updated 3 weeks ago
- ☆81Updated 8 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆209Updated 6 months ago
- multimodal document analysis☆165Updated last year
- SciRepEval benchmark training and evaluation scripts☆75Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆118Updated 2 months ago
- library supporting NLP and CV research on scientific papers☆778Updated 8 months ago
- ☆93Updated last year
- A spaCy wrapper for GliNER☆118Updated 6 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆52Updated 3 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆434Updated last year
- Semantic search engine indexing 110 million academic publications☆91Updated 3 weeks ago
- QualiGPT: An easy-to-use tool for qualitative research☆32Updated 9 months ago
- PDF parser powered by grobid☆28Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆276Updated 2 weeks ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆183Updated 2 months ago
- All the OpenAlex API endpoints that are backed by Elasticsearch☆32Updated this week
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 4 months ago
- ☆94Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆133Updated this week
- Get answers to research questions from 200M+ papers. Link to demo -☆205Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆229Updated last month
- Awesome deliberative prompting: How to ask LLMs to produce reliable reasoning and make reason-responsive decisions.☆121Updated 6 months ago
- Guideline following Large Language Model for Information Extraction☆391Updated 9 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆261Updated 9 months ago