neuml / paperetlView external linksLinks
π βοΈ ETL processes for medical and scientific papers
β663Dec 7, 2025Updated 2 months ago
Alternatives and similar repositories for paperetl
Users that are interested in paperetl are comparing it to the libraries listed below
Sorting:
- π π€ AI for medical and scientific papersβ1,721Jul 9, 2025Updated 7 months ago
- π Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applicationsβ333May 15, 2025Updated 8 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,130Updated this week
- A machine learning software for extracting information from scholarly documentsβ4,630Feb 6, 2026Updated last week
- π Semantic search for headlines and story textβ359Sep 23, 2023Updated 2 years ago
- π Build knowledge bases for RAGβ31Jul 3, 2025Updated 7 months ago
- Python client for txtaiβ14Jan 21, 2026Updated 3 weeks ago
- π Automatically annotate papers using LLMsβ403Dec 1, 2025Updated 2 months ago
- π Semantic search for developersβ541Sep 23, 2023Updated 2 years ago
- π Datasets and models for instruction-tuningβ238Sep 23, 2023Updated 2 years ago
- Open Access PDF harvesterβ42May 3, 2024Updated last year
- PDF parser powered by grobidβ28Jul 26, 2024Updated last year
- High accuracy RAG for answering questions from scientific documents with citationsβ8,086Updated this week
- A full spaCy pipeline and models for scientific/biomedical documents.β1,921Dec 4, 2025Updated 2 months ago
- ποΈ Highlight text in documentsβ111Apr 21, 2025Updated 9 months ago
- A Repo focusing on Engineering Physics Applications of MLXβ12Oct 8, 2024Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingesterβ62May 3, 2024Updated last year
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.β692May 26, 2024Updated last year
- Chatbot for The Algorithm ML repo by Twitter.β14Apr 3, 2023Updated 2 years ago
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convertβ¦β24,162Updated this week
- Findpapers: A tool for helping researchers who are looking for related worksβ306Feb 5, 2026Updated last week
- Fetch Academic Research Papers from different sourcesβ469Dec 24, 2025Updated last month
- A proof of concept to scrape papers from journalsβ295Jun 4, 2024Updated last year
- Software that makes labeling PDFs easy.β426May 13, 2024Updated last year
- Download client for legal opinionsβ13Jan 26, 2025Updated last year
- Applying NLP framework to 10-K filings in equity marketsβ14Jul 26, 2021Updated 4 years ago
- β17Nov 7, 2023Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,852Updated this week
- A basic tool that extracts the structure from the PDF files of scientific articles.β76Jan 4, 2022Updated 4 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.β3,105Nov 14, 2024Updated last year
- Efficient Retrieval Augmentation and Generation Frameworkβ1,766Jan 12, 2026Updated last month
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.ioβ148Jun 19, 2025Updated 7 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,275Mar 28, 2025Updated 10 months ago
- A spaCy pipeline and model for NLP on unstructured legal text.β672Jul 16, 2024Updated last year
- A simple library for segmenting legal textsβ17Apr 22, 2023Updated 2 years ago
- A new framework to generate interpretable classification rulesβ18Feb 11, 2023Updated 3 years ago
- Structured Outputsβ13,403Feb 6, 2026Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,852May 17, 2025Updated 8 months ago
- π Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.β438Dec 1, 2025Updated 2 months ago