ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆24Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆196Updated last year
- ☆82Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆196Updated last year
- ☆29Updated last year
- ☆89Updated last week
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆22Updated 10 months ago
- RAG-VectorDB-Embedings-LlamaIndex-Langchain☆277Updated 2 months ago
- LLM (Large Language Model) FineTuning☆566Updated 9 months ago
- ☆26Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆113Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆210Updated last year
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 3 months ago
- Collection of resources for finetuning Large Language Models (LLMs).☆109Updated last year
- Various installation guides for Large Language Models☆77Updated 8 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆132Updated last year
- ☆44Updated last year
- Apply LLMs to your data, build personal assistants, and expand your use of LLMs with agents, chains, and memories.☆138Updated 4 months ago
- ☆104Updated 9 months ago
- Multimodal RAG using Langchain☆57Updated last year
- Collection of resources for RL and Reasoning☆27Updated 11 months ago
- ☆55Updated 4 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆141Updated 11 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆236Updated 3 months ago
- Fine Tune DeepSeek☆44Updated 11 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆84Updated last month
- ☆91Updated 8 months ago
- Integrating knowledge graphs (KG) with large language models (LLM)☆184Updated 2 months ago