thoppe/The-Pile-PubMed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thoppe/The-Pile-PubMed)

thoppe / The-Pile-PubMed

Download, parse, and filter data PubMed, data-ready for The-Pile

☆23

Alternatives and similar repositories for The-Pile-PubMed

Users that are interested in The-Pile-PubMed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zh1yu4nyu / CodeIPPrompt
View on GitHub
https://icml.cc/virtual/2023/poster/24354
☆10Aug 15, 2023Updated 2 years ago
AnWang-AI / AugABSA
View on GitHub
This repository contains codes for *Sem 2023 paper “Generative Data Augmentation for Aspect Sentiment Quad Prediction”.
☆10May 30, 2023Updated 3 years ago
nii-nlp / med-eval
View on GitHub
Evaluation Pipeline for medical tasks.
☆12Apr 8, 2026Updated 3 months ago
weichen-yu / LM-Extraction
View on GitHub
☆43May 23, 2023Updated 3 years ago
amazon-science / ContraCLM
View on GitHub
[ACL 2023] Code for ContraCLM: Contrastive Learning For Causal Language Model
☆36Dec 20, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ganeshdg95 / Leveraging-Adversarial-Examples-to-Quantify-Membership-Information-Leakage
View on GitHub
☆19Mar 6, 2023Updated 3 years ago
thu-coai / Targeted-Data-Extraction
View on GitHub
Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…
☆24May 8, 2023Updated 3 years ago
mireshghallah / neighborhood-curvature-mia
View on GitHub
☆27Aug 18, 2023Updated 2 years ago
mrpeerat / CL-ReLKT
View on GitHub
The implementation of CL-ReLKT (NAACL-2022)
☆14Aug 31, 2022Updated 3 years ago
EvryRNA / alphafold3_for_rna
View on GitHub
☆15Jan 27, 2025Updated last year
EvryRNA / rnadvisor
View on GitHub
RNAdvisor is a docker-based wrapper that integrates other metrics and scoring functions for RNA 3D structure evaluation.
☆18May 6, 2025Updated last year
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
hsajjad / ConceptX
View on GitHub
Analyzing Latent Concept in Pre-trained Transformer Models
☆12Jul 18, 2022Updated 4 years ago
Davidham3 / pems_crawler
View on GitHub
PeMS crawler
☆15Jan 2, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
pubmedqa / pubmedqa
View on GitHub
PubMedQA: A Dataset for Biomedical Research Question Answering
☆434Apr 18, 2023Updated 3 years ago
sumonbis / FairPreprocessing
View on GitHub
This repository contains the artifacts accompanied by the paper "Fair Preprocessing"
☆13Jul 20, 2021Updated 5 years ago
Alexzhuan / awesome-kbqa
View on GitHub
🤡 An up-to-date & curated list of awesome KBQA papers, methods & resources.
☆10Jul 14, 2022Updated 4 years ago
BYU-PCCL / prompt-compression-contrastive-coding
View on GitHub
Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"
☆14May 31, 2023Updated 3 years ago
henryzhao5852 / DELFT
View on GitHub
☆12Feb 26, 2020Updated 6 years ago
allenai / scirepeval
View on GitHub
SciRepEval benchmark training and evaluation scripts
☆89May 5, 2026Updated 2 months ago
weilicao / SPScanner
View on GitHub
[COLM '25] Single-Pass Document Scanning for Question Answering
☆14Aug 20, 2025Updated 11 months ago
naturalconv / NaturalConvDataSet
View on GitHub
☆22Mar 19, 2021Updated 5 years ago
LarsHoldijk / RE-ParameterizedExplainerForGraphNeuralNetworks
View on GitHub
☆58Mar 22, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
kernelmachine / cbtm
View on GitHub
Code repository for the c-BTM paper
☆109Sep 26, 2023Updated 2 years ago
sbrisard / pyzottk
View on GitHub
Python toolkit to access a Zotero library
☆12Feb 9, 2019Updated 7 years ago
RUCAIBox / MPOP
View on GitHub
☆13Jun 16, 2021Updated 5 years ago
BGI-HangzhouAI / ATLAS
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
QizhiPei / BioT5
View on GitHub
BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)
☆127Sep 14, 2024Updated last year
dhruvdcoder / wandb-utils
View on GitHub
Utility functions for weights and biases (wandb).
☆11Sep 17, 2024Updated last year
AllTheBacteria / Phylign
View on GitHub
Map query sequences to the assemblies of all pre-June 2023 bacteria (https://ftp.ebi.ac.uk/pub/databases/AllTheBacteria/Releases/0.2/) on…
☆12May 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
skmda37 / CartoonX
View on GitHub
CartoonX is a saliency map method for image classifiers operating in the wavelet/shearlet domain.
☆10Feb 23, 2026Updated 5 months ago
lukoucky / image_recommendation
View on GitHub
Image recommendation service with image on the input that outputs most similar images from database.
☆14Sep 19, 2020Updated 5 years ago
wenty2015 / Predicting-Clinical-Events-via-Recurrent-Neural-Networks
View on GitHub
☆12Dec 19, 2016Updated 9 years ago
SJTU-CGM / PPanG
View on GitHub
a precise pangenome browser combining linear and graph-based pan-genome
☆13Jul 16, 2024Updated 2 years ago
rcedgar / viratax
View on GitHub
Taxonomy classification of viral sequences / contigs
☆12Jul 15, 2025Updated last year
jiachengxiong / Literature
View on GitHub
Recent application of graph neural network in drug discovery
☆14Mar 19, 2020Updated 6 years ago
blinry / sillypond
View on GitHub
Generates unplayable music in the style of John Stump's "Faerie’s Aire and Death Waltz".
☆12Nov 9, 2019Updated 6 years ago