Andrei-Aksionov / nanoGPTplusLinks
☆53Updated last year
Alternatives and similar repositories for nanoGPTplus
Users that are interested in nanoGPTplus are comparing it to the libraries listed below
Sorting:
- Training and Fine-tuning an llm in Python and PyTorch.☆43Updated 2 years ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆113Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Updated last year
- experiments with inference on llama☆103Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆268Updated last month
- Minimal code to train a Large Language Model (LLM).☆172Updated 3 years ago
- Inference Llama 2 in one file of pure Python☆425Updated last month
- ☆94Updated 2 years ago
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- nanogpt turned into a chat model☆80Updated 2 years ago
- batched loras☆347Updated 2 years ago
- ☆198Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆259Updated 2 years ago
- ☆416Updated 2 years ago
- Fast bare-bones BPE for modern tokenizer training☆174Updated 6 months ago
- Experiments on speculative sampling with Llama models☆127Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆203Updated last year
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆59Updated 2 years ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"☆392Updated last year
- Google TPU optimizations for transformers models☆133Updated 3 weeks ago
- Fine-tuning LLMs using QLoRA☆266Updated last year
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆276Updated this week
- 1.58-bit LLaMa model☆83Updated last year
- LLM Workshop by Sourab Mangrulkar☆400Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- ☆86Updated last year