EleutherAI / pile-pubmedcentralView external linksLinks
A script for collecting the PubMed Central dataset in a language modelling friendly format.
☆25Feb 16, 2021Updated 4 years ago
Alternatives and similar repositories for pile-pubmedcentral
Users that are interested in pile-pubmedcentral are comparing it to the libraries listed below
Sorting:
- Download, parse, and filter data PubMed, data-ready for The-Pile☆23Dec 16, 2021Updated 4 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆47Sep 22, 2020Updated 5 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Mar 21, 2023Updated 2 years ago
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)☆11Mar 17, 2023Updated 2 years ago
- Graph-based Image Inpainting☆15Apr 18, 2020Updated 5 years ago
- A combination of RoBERTa trained from scratch on masking histone modification patterns rather than the English language and XGBoost, pred…☆13Apr 22, 2021Updated 4 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Mar 26, 2024Updated last year
- NHS England PhD Internship Projects Pages☆19Oct 3, 2025Updated 4 months ago
- Drift detection module for machine learning pipelines.☆24Jun 21, 2023Updated 2 years ago
- Materials and scripts for building cell type encyclopedia table☆20Dec 2, 2025Updated 2 months ago
- The World's Most Difficult video game☆32Dec 24, 2025Updated last month
- ☆24Feb 3, 2019Updated 7 years ago
- Code for the PAPA paper☆27Nov 8, 2022Updated 3 years ago
- A multi-agent mind implemented using LLMs engaged in ongoing conversation☆26Mar 1, 2023Updated 2 years ago
- COVID-19 Related NLP Papers☆30Jan 20, 2022Updated 4 years ago
- WeatherFusionNet - our solution to the NeurIPS 2022 Weather4cast competition☆33Nov 30, 2023Updated 2 years ago
- Code for IROS 2020 paper: https://arxiv.org/abs/1910.04854☆27Aug 30, 2024Updated last year
- Pytorch code for TM-GCN, a Dynamic Graph Convolutional Networks Using the Tensor M-Product☆29Jun 16, 2021Updated 4 years ago
- Talk to your CSV: how to Visualize Your Data with Langchain and Streamlit☆29Aug 26, 2023Updated 2 years ago
- Program and links to the material for the GloBIAS Training School 2025, Kobe, Japan.☆22Oct 27, 2025Updated 3 months ago
- LLM-powered Q/A over arXiv preprints☆32Apr 5, 2023Updated 2 years ago
- ☆36Mar 5, 2025Updated 11 months ago
- A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstra…☆43Jan 31, 2025Updated last year
- Here, I provided the solution for exercises of IBM Quantum Challenge 2020☆10Oct 27, 2020Updated 5 years ago
- Neural Error Mitigation of Near-Term Quantum Simulations (arXiv:2105.08086)☆10Jul 6, 2022Updated 3 years ago
- Examples related to Amazon Lightsail☆12Jul 17, 2024Updated last year
- [TIP-2017] Official MATLAB implementation of the "ESIM: Edge Similarity for Screen Content Image Quality Assessment"☆11Jul 8, 2025Updated 7 months ago
- Harness CI migration utility☆11Dec 17, 2025Updated last month
- ☆13Updated this week
- Material for the course Theories of Quantum Matter at the University of Cambridge☆11Jan 20, 2023Updated 3 years ago
- Some ROS code examples which hopefully are robot-agnostic.☆12May 16, 2018Updated 7 years ago
- ☆11Sep 17, 2020Updated 5 years ago
- Bioinformatics'2023: Consistency Enhancement of Model Prediction on Document-level Named Entity Recognition☆13Jun 8, 2023Updated 2 years ago
- Simple tutorial to get familiar with how to program quantum computers using Qiskit☆11Sep 9, 2019Updated 6 years ago
- The MolE pre-training framework to learn general molecular representations from unlabeled structures☆12May 26, 2025Updated 8 months ago
- ☆10Jul 22, 2024Updated last year
- Interactive single cell RNA-seq analysis webserver!☆10Mar 15, 2023Updated 2 years ago
- A graph based image processing and generation tool.☆14Nov 18, 2025Updated 2 months ago