Open-Superintelligence-Lab / blueberry-llmLinks
☆63Updated this week
Alternatives and similar repositories for blueberry-llm
Users that are interested in blueberry-llm are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 3 months ago
- ☆26Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 8 months ago
- One click templates for inferencing Language Models☆222Updated 3 weeks ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- ☆63Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆79Updated 3 months ago
- ☆159Updated 8 months ago
- The State Of The Art, intelligence☆157Updated 4 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆106Updated 2 months ago
- ☆101Updated last week
- ☆68Updated 6 months ago
- ☆86Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆481Updated 3 months ago
- Simple examples using Argilla tools to build AI☆56Updated last year
- ☆46Updated 8 months ago
- Marketplace ML experiment - training without backprop☆27Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- rl from zero pretrain, can it be done? yes.☆282Updated 2 months ago
- Distributed Inference for mlx LLm☆99Updated last year
- Train transformer language models with reinforcement learning.☆19Updated 9 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- ☆15Updated last week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 11 months ago
- ☆103Updated 2 months ago
- ☆136Updated 8 months ago