ksm26 / Pretraining-LLMsLinks

Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.

☆21

Alternatives and similar repositories for Pretraining-LLMs

Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below

Sorting:

ChanCheeKean / DataScience
☆84Updated last year
alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…
☆77Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆172Updated 11 months ago
Andy-jqa / biomedical-qa-datasets
Biomedical Question Answering Datasets.
☆112Updated 2 months ago
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆99Updated last year
rbiswasfc / llm-detect-ai
1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition
☆198Updated last year
dptech-corp / Uni-SMART
☆46Updated 9 months ago
Praveen76 / LLMs-Interview-Prep-Guide
Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Dat…
☆145Updated last year
openlifescience-ai / Open-Medical-Reasoning-Tasks
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
☆125Updated 10 months ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆74Updated last year
som-shahlab / Clinfo.AI
This is Clinfo.AI Demo Instruction
☆34Updated 11 months ago
FudanDNN-NLP / RAG
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)
☆327Updated 7 months ago
FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆180Updated last year
dmis-lab / self-biorag
[ISMB '24] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models
☆63Updated last year
aymeric-roucher / benchmark_agents
☆27Updated last year
rbiswasfc / llm-science-exam
6th Position Solution Code for Kaggle - LLM Science Exam Competition
☆23Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 9 months ago
medmcqa / medmcqa
A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.
☆222Updated 2 years ago
lamm-mit / LLM-finetuning
☆27Updated 10 months ago
abhinand5 / MedEmbed
MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.
☆70Updated 9 months ago
youssefHosni / Weekly-Top-LLM-Papers
Curated list of weekly published LLM papers
☆180Updated 2 weeks ago
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆387Updated last year
ncbi-nlp / cell-o1
Code and data for Cell-o1.
☆19Updated last week
docugami / KG-RAG-datasets
Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets
☆166Updated last year
ashishjamarkattel / reinforment-learning-with-human-feedback
☆15Updated last year
neubig / minllama-assignment
☆90Updated 10 months ago
rohan-paul / LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
☆546Updated 3 months ago
Shekswess / LLM-Medical-Finetuning
A code repository that cointains all the code for finetuning some of the popular LLMs on medical data
☆57Updated last year
OSU-NLP-Group / LLM4Chem
Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality …
☆94Updated last month
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 9 months ago