thoppe / The-Pile-PubMed
Download, parse, and filter data PubMed, data-ready for The-Pile
☆22Updated 3 years ago
Alternatives and similar repositories for The-Pile-PubMed:
Users that are interested in The-Pile-PubMed are comparing it to the libraries listed below
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆24Updated 4 years ago
- ☆24Updated 3 months ago
- Tasks for describing differences between text distributions.☆16Updated 6 months ago
- ☆33Updated 10 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated this week
- Supporting code for ReCEval paper☆28Updated 5 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- ☆17Updated 7 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆48Updated last year
- Retrieval as Attention☆83Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆32Updated 2 months ago
- ☆23Updated 3 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆50Updated 9 months ago
- ☆13Updated last year
- ☆49Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆21Updated 2 months ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆89Updated 3 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 9 months ago
- ☆39Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆25Updated 5 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆32Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Codebase for Instruction Following without Instruction Tuning☆33Updated 4 months ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- ☆111Updated 7 months ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year