saurabhaloneai / qwen3-expLinks
qwen3 experiments
☆33Updated 6 months ago
Alternatives and similar repositories for qwen3-exp
Users that are interested in qwen3-exp are comparing it to the libraries listed below
Sorting:
- ☆78Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆82Updated 4 months ago
- Lego for GRPO☆30Updated 7 months ago
- ☆15Updated 3 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆90Updated 11 months ago
- ☆68Updated 7 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- Marketplace ML experiment - training without backprop☆27Updated 4 months ago
- ☆62Updated 5 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- ☆37Updated 5 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆76Updated 7 months ago
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆92Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 11 months ago
- Exploring Agno framework for building AI agents.☆25Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- ☆46Updated 9 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 7 months ago
- ☆57Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- ☆30Updated last year
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆223Updated 2 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- This codebase demonstrates various DSPy functionalities through practical examples.☆56Updated 10 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 6 months ago
- RL gym for vision language models written in JAX☆139Updated 2 months ago