thoppe / The-Pile-PubMedLinks
Download, parse, and filter data PubMed, data-ready for The-Pile
☆23Updated 4 years ago
Alternatives and similar repositories for The-Pile-PubMed
Users that are interested in The-Pile-PubMed are comparing it to the libraries listed below
Sorting:
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Updated 4 years ago
- Pre-trained Language Model for Scientific Text☆46Updated last year
- ☆28Updated 9 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆47Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Updated 2 years ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Updated 2 years ago
- Pretraining Efficiently on S2ORC!☆175Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆46Updated 9 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33Updated last year
- ☆130Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆55Updated last year
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆23Updated 8 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆85Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆90Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆99Updated 4 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆60Updated 2 years ago
- Retrieval as Attention☆82Updated 3 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- ☆75Updated last year
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- ☆56Updated 2 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆82Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆48Updated 11 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)☆35Updated 2 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 2 years ago