ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆22Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- ☆84Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆179Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆333Updated 8 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆182Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆202Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆128Updated 11 months ago
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆52Updated last year
- ☆22Updated last year
- ☆27Updated 11 months ago
- LLM (Large Language Model) FineTuning☆560Updated 5 months ago
- Distributed training (multi-node) of a Transformer model☆80Updated last year
- Apply LLMs to your data, build personal assistants, and expand your use of LLMs with agents, chains, and memories.☆127Updated last week
- ☆94Updated 5 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- Multimodal RAG using Langchain☆54Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- ☆145Updated last year
- Notes and commented code for RLHF (PPO)☆104Updated last year
- Simple introduction to LLM Agents☆139Updated last year
- ☆54Updated last week
- ☆86Updated last year
- Integrating knowledge graphs (KG) with large language models (LLM)☆171Updated 8 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆226Updated 2 months ago
- cheap & easy LLM experiments for amateurs (alpha)☆21Updated 2 weeks ago
- ☆46Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆322Updated 4 months ago
- Building LLaMA 4 MoE from Scratch☆62Updated 4 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆125Updated 2 years ago