π βοΈ ETL processes for medical and scientific papers
β697Dec 7, 2025Updated 6 months ago
Alternatives and similar repositories for paperetl
Users that are interested in paperetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π π€ AI for medical and scientific papersβ1,760Jul 9, 2025Updated 11 months ago
- β‘ Local chat assistants with AI superpowersβ336Feb 13, 2026Updated 4 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,642Jun 8, 2026Updated last week
- A machine learning software for extracting information from scholarly documentsβ4,940Updated this week
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python client for txtaiβ15Jun 4, 2026Updated last week
- π Build knowledge bases for RAGβ32Apr 20, 2026Updated last month
- COVID-19 Open Research Dataset (CORD-19) Analysisβ57Nov 20, 2022Updated 3 years ago
- Tokenizer for Text to Speech (TTS) modelsβ14Jan 16, 2025Updated last year
- PDF parser powered by grobidβ28Jul 26, 2024Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingesterβ62May 3, 2024Updated 2 years ago
- π Semantic search for developersβ542Sep 23, 2023Updated 2 years ago
- π Automatically annotate papers using LLMsβ415May 5, 2026Updated last month
- LLM Chain querying a scientific Zotero library, with citationsβ440Aug 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ποΈ Highlight text in documentsβ113Feb 13, 2026Updated 4 months ago
- π Datasets and models for instruction-tuningβ238Sep 23, 2023Updated 2 years ago
- Open Access PDF harvesterβ42May 3, 2024Updated 2 years ago
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.β47Nov 7, 2024Updated last year
- Magnitude fork that only supports Word2Vec, GloVe and fastText embeddingsβ13Aug 11, 2020Updated 5 years ago
- Generate time-lapse video for a websiteβ21Mar 4, 2022Updated 4 years ago
- High accuracy RAG for answering questions from scientific documents with citationsβ8,703Updated this week
- Scientific literature explorer. Runs a Pubmed or Semantic Scholar search and allows user to explore high-level structure of result papersβ51Apr 15, 2026Updated 2 months ago
- A full spaCy pipeline and models for scientific/biomedical documents.β1,964Dec 4, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β18Nov 7, 2023Updated 2 years ago
- Python PDF parser for scientific publications: content and figuresβ454Mar 21, 2024Updated 2 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.β700May 26, 2024Updated 2 years ago
- Findpapers: A tool for helping researchers who are looking for related worksβ363May 28, 2026Updated 2 weeks ago
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.β528Updated this week
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β53Mar 17, 2025Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.β15Jun 9, 2019Updated 7 years ago
- Pipeline for analyzing rare mutations in metagenome-assembled genomesβ10Apr 4, 2025Updated last year
- h-index-reader is a module that allows you to retrieve author's h-index information from different sources including Google Scholar.β14Oct 22, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Software that makes labeling PDFs easy.β429May 13, 2024Updated 2 years ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β25,571Updated this week
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ39Mar 8, 2022Updated 4 years ago
- π Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.β454Jun 5, 2026Updated last week
- Fetch Academic Research Papers from different sourcesβ485Dec 24, 2025Updated 5 months ago
- Unofficial Python client library for Semantic Scholar APIs.β462May 27, 2026Updated 2 weeks ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,283Mar 28, 2025Updated last year