hkproj / mistral-src-commentedLinks
Reference implementation of Mistral AI 7B v0.1 model.
☆29Updated last year
Alternatives and similar repositories for mistral-src-commented
Users that are interested in mistral-src-commented are comparing it to the libraries listed below
Sorting:
- ML algorithms implementations that are good for learning the underlying principles☆22Updated 5 months ago
- ☆35Updated last week
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆104Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated last week
- Notes on the Mistral AI model☆19Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- Various installation guides for Large Language Models☆68Updated last month
- One click templates for inferencing Language Models☆185Updated 3 weeks ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated last year
- making the official triton tutorials actually comprehensible☆34Updated 2 months ago
- Notes on quantization in neural networks☆83Updated last year
- ☆168Updated 5 months ago
- a simplified version of Google's Gemma model to be used for learning☆25Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 2 weeks ago
- coding CUDA everyday!☆31Updated last month
- Set of scripts to finetune LLMs☆37Updated last year
- rl from zero pretrain, can it be done? we'll see.☆24Updated this week
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆122Updated last year
- ☆54Updated 3 months ago
- Notes about LLaMA 2 model☆61Updated last year
- Fine tune Gemma 3 on an object detection task☆43Updated this week
- From scratch implementation of a vision language model in pure PyTorch☆220Updated last year
- Distributed training (multi-node) of a Transformer model☆68Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- ☆46Updated 2 months ago
- LLaMA 2 implemented from scratch in PyTorch☆329Updated last year
- Google TPU optimizations for transformers models☆112Updated 4 months ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆29Updated 3 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- ☆39Updated last month