cindysridykhan / instruct_storyteller_tinyllama2Links
Training and Fine-tuning an llm in Python and PyTorch.
☆43Updated 2 years ago
Alternatives and similar repositories for instruct_storyteller_tinyllama2
Users that are interested in instruct_storyteller_tinyllama2 are comparing it to the libraries listed below
Sorting:
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆163Updated 4 months ago
- ☆86Updated last year
- Pre-training code for Amber 7B LLM☆170Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆50Updated last month
- Data preparation code for Amber 7B LLM☆94Updated last year
- A bagel, with everything.☆326Updated last year
- ☆95Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- ☆78Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- A pipeline for LLM knowledge distillation☆111Updated 8 months ago
- ☆74Updated 2 years ago
- ☆85Updated 2 years ago
- experiments with inference on llama☆103Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆245Updated last year
- Experiments on speculative sampling with Llama models☆127Updated 2 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Updated 2 years ago
- nanogpt turned into a chat model☆79Updated 2 years ago
- ☆94Updated 2 years ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆137Updated 2 years ago
- A compact LLM pretrained in 9 days by using high quality data☆337Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆315Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆141Updated 2 years ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year