ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆24Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ☆82Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- ☆30Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- ☆92Updated last week
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Updated last year
- ☆27Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- LLM (Large Language Model) FineTuning☆565Updated 10 months ago
- ☆55Updated 5 months ago
- Multimodal RAG using Langchain☆58Updated 2 years ago
- Collection of resources for finetuning Large Language Models (LLMs).☆111Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆132Updated last year
- Curated list of weekly published LLM papers☆200Updated 3 weeks ago
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆23Updated 11 months ago
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , A…☆104Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Learn the building blocks of how to build gpt-oss from scratch☆113Updated 4 months ago
- ☆147Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated 4 months ago
- Various installation guides for Large Language Models☆77Updated 9 months ago
- Collection of resources for RL and Reasoning☆27Updated last year
- LLM Workshop by Sourab Mangrulkar☆401Updated last year
- Distributed training (multi-node) of a Transformer model☆93Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆211Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆129Updated 2 years ago
- Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.☆62Updated 2 years ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆311Updated last year
- Notes and commented code for RLHF (PPO)☆124Updated last year