ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆21Updated 11 months ago
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ☆84Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆172Updated 11 months ago
- Biomedical Question Answering Datasets.☆112Updated 2 months ago
- Notes and commented code for RLHF (PPO)☆99Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆198Updated last year
- ☆46Updated 9 months ago
- Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Dat…☆145Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆125Updated 10 months ago
- Distributed training (multi-node) of a Transformer model☆74Updated last year
- This is Clinfo.AI Demo Instruction☆34Updated 11 months ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆327Updated 7 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆180Updated last year
- [ISMB '24] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆63Updated last year
- ☆27Updated last year
- 6th Position Solution Code for Kaggle - LLM Science Exam Competition☆23Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆222Updated 2 years ago
- ☆27Updated 10 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆70Updated 9 months ago
- Curated list of weekly published LLM papers☆180Updated 2 weeks ago
- LLM Workshop by Sourab Mangrulkar☆387Updated last year
- Code and data for Cell-o1.☆19Updated last week
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆166Updated last year
- ☆15Updated last year
- ☆90Updated 10 months ago
- LLM (Large Language Model) FineTuning☆546Updated 3 months ago
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆57Updated last year
- Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality …☆94Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 9 months ago