ksm26 / Pretraining-LLMsLinks
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆24Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ☆83Updated last year
- Collection of resources for finetuning Large Language Models (LLMs).☆101Updated 9 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆186Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆336Updated 9 months ago
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆63Updated last year
- ☆54Updated last month
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆186Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 11 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆228Updated last week
- ☆27Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆128Updated last year
- minimal GRPO implementation from scratch☆98Updated 6 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆178Updated 5 months ago
- LLM (Large Language Model) FineTuning☆563Updated 6 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 2 months ago
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆203Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- ☆23Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- a curated list of the role of small models in the LLM era☆105Updated last year
- ☆27Updated last year
- ☆43Updated last year
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆308Updated 11 months ago
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆148Updated 11 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Multimodal RAG using Langchain☆55Updated last year
- [ISMB 2024] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆64Updated last year
- ☆87Updated last year