ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆24Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ☆81Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆191Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆190Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆342Updated 11 months ago
- LLM (Large Language Model) FineTuning☆565Updated 7 months ago
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆208Updated last year
- Notes and commented code for RLHF (PPO)☆116Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated last year
- ☆85Updated this week
- ☆28Updated last year
- An innovative application designed to help pharmacists and pharmacy students quickly research FDA-approved drugs by retrieving relevant i…☆21Updated 8 months ago
- Collection of resources for finetuning Large Language Models (LLMs).☆104Updated 10 months ago
- Apply LLMs to your data, build personal assistants, and expand your use of LLMs with agents, chains, and memories.☆132Updated 2 months ago
- Various installation guides for Large Language Models☆76Updated 6 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Building LLaMA 4 MoE from Scratch☆68Updated 7 months ago
- Includes examples on how to evaluate LLMs☆23Updated last year
- LLM Evals Leaderboard☆48Updated 2 years ago
- LLM Workshop by Sourab Mangrulkar☆396Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆140Updated 10 months ago
- ☆88Updated 2 years ago
- RAG-VectorDB-Embedings-LlamaIndex-Langchain☆270Updated last month
- ☆17Updated last year
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆21Updated 9 months ago
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆194Updated last month
- Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.☆60Updated last year
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆19Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆130Updated last year