hkproj / mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
☆28Updated last year
Alternatives and similar repositories for mistral-src-commented:
Users that are interested in mistral-src-commented are comparing it to the libraries listed below
- a simplified version of Google's Gemma model to be used for learning☆24Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago
- Notes on the Mistral AI model☆18Updated last year
- ML algorithms implementations that are good for learning the underlying principles☆20Updated 3 months ago
- ☆136Updated 2 months ago
- One click templates for inferencing Language Models☆165Updated this week
- Training and Fine-tuning an llm in Python and PyTorch.☆41Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆105Updated 5 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆99Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆307Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆196Updated 8 months ago
- ☆52Updated last month
- End-to-End LLM Guide☆104Updated 8 months ago
- Notes on quantization in neural networks☆77Updated last year
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- Various installation guides for Large Language Models☆64Updated 4 months ago
- Distributed training (multi-node) of a Transformer model☆59Updated 11 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Set of scripts to finetune LLMs☆37Updated 11 months ago
- Collection of autoregressive model implementation☆83Updated last month
- ☆32Updated last month
- From scratch implementation of a vision language model in pure PyTorch☆205Updated 10 months ago
- ☆87Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆277Updated last month