ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆21Updated 10 months ago
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ☆83Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- Research projects built on top of Transformers☆58Updated 3 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆18Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆175Updated last year
- Notes and commented code for RLHF (PPO)☆96Updated last year
- Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Dat…☆137Updated last year
- Various installation guides for Large Language Models☆70Updated 2 months ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 9 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆218Updated 2 years ago
- [ISMB '24] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆63Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , A…☆69Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆45Updated 9 months ago
- LLM (Large Language Model) FineTuning☆542Updated 2 months ago
- ☆24Updated 5 months ago
- ☆37Updated 3 weeks ago
- Quick tutorial showing how to fine-tune Llama3.1 with nothing but free tools and text data. All code included in ipynb. For a step by ste…☆9Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 8 months ago
- Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.☆54Updated last year
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆160Updated last year
- ☆20Updated 3 years ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- ☆27Updated last year
- ☆26Updated 9 months ago
- Build an LLM powered Ask the Data App with LangChain (using the Pandas DataFrame Agent) and Streamlit☆28Updated last year
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆54Updated 4 months ago
- This is Clinfo.AI Demo Instruction☆34Updated 10 months ago