allenai / pdf-component-libraryLinks
☆88Updated last year
Alternatives and similar repositories for pdf-component-library
Users that are interested in pdf-component-library are comparing it to the libraries listed below
Sorting:
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆46Updated last year
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆258Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆220Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆107Updated 9 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆453Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- PDF parser powered by grobid☆28Updated last year
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆165Updated 9 months ago
- Unofficial Python client library for Semantic Scholar APIs.☆430Updated last month
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆249Updated last week
- library supporting NLP and CV research on scientific papers☆788Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 10 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆221Updated 5 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆198Updated 8 months ago
- ☆116Updated 3 months ago
- Extract structured text from pdfs quickly☆656Updated 7 months ago
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆147Updated 7 months ago
- Python PDF parser for scientific publications: content and figures☆448Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆446Updated last year
- ☆106Updated 3 months ago
- Social and customizable AI writing assistant! ✍️☆259Updated 7 months ago
- A spaCy wrapper for GliNER☆129Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆296Updated last week
- All the OpenAlex API endpoints that are backed by Elasticsearch☆38Updated this week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- multimodal document analysis☆166Updated 2 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆317Updated last year
- ☆47Updated 5 months ago
- 🤖🌊 aiFlows: The building blocks of your collaborative AI☆272Updated last year
- SciRepEval benchmark training and evaluation scripts☆80Updated this week