thoppe / The-Pile-PubMedLinks
Download, parse, and filter data PubMed, data-ready for The-Pile
☆23Updated 3 years ago
Alternatives and similar repositories for The-Pile-PubMed
Users that are interested in The-Pile-PubMed are comparing it to the libraries listed below
Sorting:
- Few-shot Learning with Auxiliary Data☆28Updated last year
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆24Updated 4 years ago
- Retrieval as Attention☆82Updated 2 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆25Updated 2 years ago
- Supporting code for ReCEval paper☆28Updated 9 months ago
- Pre-trained Language Model for Scientific Text☆45Updated last year
- Interpretable unified language safety checking with large language models☆30Updated 2 years ago
- ☆28Updated 4 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 9 months ago
- ☆22Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆14Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- ☆72Updated last year
- ☆35Updated last year
- Uncertainty-Aware Reliable Text Classification (KDD 2021)☆18Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated 3 months ago
- Offiical codes for DNA-GPT (ICLR 2024)☆50Updated last year
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆14Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- ☆39Updated 2 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆23Updated 2 months ago
- Tasks for describing differences between text distributions.☆16Updated 10 months ago
- End-to-end training of Retrieval-Augmented LMs (REALM, RAG)☆22Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.☆72Updated 2 months ago