kabir2505 / tiny-mixtralLinks

☆43

Alternatives and similar repositories for tiny-mixtral

Users that are interested in tiny-mixtral are comparing it to the libraries listed below

Sorting:

kmohan321 / Research_Papers
☆46Updated 4 months ago
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆94Updated 4 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 3 months ago
joey00072 / Multi-Head-Latent-Attention-MLA-
working implimention of deepseek MLA
☆42Updated 6 months ago
evintunador / triton_docs_tutorials
making the official triton tutorials actually comprehensible
☆53Updated 2 weeks ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
hkproj / triton-flash-attention
☆184Updated 7 months ago
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated 2 months ago
cloneofsimo / ptx-tutorial-by-aislop
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Updated 4 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
usamec / lowmem_finetuning
Low memory full parameter finetuning of LLMs
☆52Updated 2 weeks ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆66Updated 2 weeks ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated last month
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago
1y33 / 100Days
GPU Kernels
☆191Updated 3 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 11 months ago
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆64Updated 2 months ago
OpenMachine-ai / transformer-tricks
A collection of tricks and tools to speed up transformer models
☆169Updated 2 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆82Updated 2 months ago
hkproj / multi-latent-attention
☆43Updated 2 months ago
joey00072 / nanoGRPO
nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)
☆113Updated 2 months ago
tilde-research / MoMoE-impl
Memory optimized Mixture of Experts
☆51Updated last week
JINO-ROHIT / advanced_ml
☆59Updated last week
huggingface / picotron_tutorial
☆206Updated 5 months ago
FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆69Updated 4 months ago
FareedKhan-dev / train-llama4
Building LLaMA 4 MoE from Scratch
☆60Updated 3 months ago
mingyin0312 / RL4LLM
RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct
☆29Updated 5 months ago
goyalpramod / Foundational-ML-papers
Implementations of Papers that I read, you can read my breakdown in my blog
☆78Updated 2 weeks ago