Shekswess / tiny-reasoning-language-modelLinks
Code repository dedicated to experimenting and research with tiny reasoning language model
☆43Updated last month
Alternatives and similar repositories for tiny-reasoning-language-model
Users that are interested in tiny-reasoning-language-model are comparing it to the libraries listed below
Sorting:
- Exploring Applications of GRPO☆251Updated 4 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆328Updated 2 months ago
- minimal GRPO implementation from scratch☆102Updated 9 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆140Updated 8 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆575Updated 3 months ago
- ☆465Updated 4 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- ☆46Updated 9 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆142Updated last year
- ☆537Updated 5 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆79Updated 7 months ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- A practical guide to diffusion models, implemented from scratch.☆232Updated last week
- An extension of the nanoGPT repository for training small MOE models.☆224Updated 10 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 8 months ago
- Learn the building blocks of how to build DeepSeek from scratch.☆89Updated 3 months ago
- ☆224Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- Build your own visual reasoning model☆416Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 7 months ago
- Distributed training (multi-node) of a Transformer model☆90Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆340Updated 9 months ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Updated 10 months ago
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- ☆45Updated 8 months ago
- ☆69Updated 5 months ago
- GPU Kernels☆217Updated 8 months ago
- Simple repository for training small reasoning models☆47Updated 11 months ago
- Best practices & guides on how to write distributed pytorch training code☆562Updated 2 months ago