π βοΈ ETL processes for medical and scientific papers
β678Dec 7, 2025Updated 4 months ago
Alternatives and similar repositories for paperetl
Users that are interested in paperetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π π€ AI for medical and scientific papersβ1,755Jul 9, 2025Updated 9 months ago
- β‘ Local chat assistants with AI superpowersβ336Feb 13, 2026Updated 2 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,453Updated this week
- A machine learning software for extracting information from scholarly documentsβ4,830Updated this week
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python client for txtaiβ15Apr 29, 2026Updated last week
- π Build knowledge bases for RAGβ32Apr 20, 2026Updated 2 weeks ago
- COVID-19 Open Research Dataset (CORD-19) Analysisβ57Nov 20, 2022Updated 3 years ago
- PDF parser powered by grobidβ28Jul 26, 2024Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingesterβ62May 3, 2024Updated 2 years ago
- π Semantic search for developersβ543Sep 23, 2023Updated 2 years ago
- π Automatically annotate papers using LLMsβ413Dec 1, 2025Updated 5 months ago
- LLM Chain querying a scientific Zotero library, with citationsβ441Aug 4, 2023Updated 2 years ago
- ποΈ Highlight text in documentsβ113Feb 13, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- π Datasets and models for instruction-tuningβ238Sep 23, 2023Updated 2 years ago
- Open Access PDF harvesterβ42May 3, 2024Updated 2 years ago
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.β47Nov 7, 2024Updated last year
- High accuracy RAG for answering questions from scientific documents with citationsβ8,436Mar 20, 2026Updated last month
- Scientific literature explorer. Runs a Pubmed or Semantic Scholar search and allows user to explore high-level structure of result papersβ51Apr 15, 2026Updated 3 weeks ago
- A full spaCy pipeline and models for scientific/biomedical documents.β1,947Dec 4, 2025Updated 5 months ago
- My Gen AI researchβ11Jun 3, 2024Updated last year
- Python PDF parser for scientific publications: content and figuresβ452Mar 21, 2024Updated 2 years ago
- β18Nov 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.β699May 26, 2024Updated last year
- Findpapers: A tool for helping researchers who are looking for related worksβ348Updated this week
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.β515Mar 17, 2026Updated last month
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β52Mar 17, 2025Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.β15Jun 9, 2019Updated 6 years ago
- h-index-reader is a module that allows you to retrieve author's h-index information from different sources including Google Scholar.β14Oct 22, 2020Updated 5 years ago
- Software that makes labeling PDFs easy.β428May 13, 2024Updated last year
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β25,059Updated this week
- π Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.β449Dec 1, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fetch Academic Research Papers from different sourcesβ480Dec 24, 2025Updated 4 months ago
- Unofficial Python client library for Semantic Scholar APIs.β453Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,282Mar 28, 2025Updated last year
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/β1,045Apr 26, 2024Updated 2 years ago
- Retrieve and extract citations from Crossref dataβ29Mar 11, 2021Updated 5 years ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generateβ¦β44Aug 27, 2024Updated last year
- β758May 22, 2023Updated 2 years ago