VizuaraAI / truly-open-gpt-ossLinks
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆79Updated 3 months ago
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below
Sorting:
- Learn the building blocks of how to build gpt-oss from scratch☆106Updated 2 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆40Updated 2 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- ☆122Updated 6 months ago
- ☆159Updated 8 months ago
- ☆62Updated 5 months ago
- ☆109Updated 6 months ago
- Train LLM on Hugging Face infra☆67Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- RLVR Testing and Training☆23Updated 3 months ago
- Sparse Inferencing for transformer based LLMs☆215Updated 4 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- ☆46Updated 8 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated 2 weeks ago
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- ☆63Updated this week
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 7 months ago
- ☆68Updated 6 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated last week
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆107Updated last month
- ☆101Updated 6 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆57Updated 3 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Updated last year
- ☆63Updated 5 months ago
- All information and news with respect to Falcon-H1 series☆93Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆103Updated 7 months ago