Red-Hat-AI-Innovation-Team / async-grpoLinks
☆28Updated 2 months ago
Alternatives and similar repositories for async-grpo
Users that are interested in async-grpo are comparing it to the libraries listed below
Sorting:
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆48Updated 6 months ago
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- fast trainer for educational purposes☆16Updated this week
- A Python library for inference-time scaling LLMs☆13Updated 3 weeks ago
- 🧱 Modula software package☆233Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆157Updated 2 months ago
- Learn online intrinsic rewards from LLM feedback☆43Updated 8 months ago
- ☆152Updated last month
- A MAD laboratory to improve AI architecture designs 🧪☆128Updated 8 months ago
- seqax = sequence modeling + JAX☆166Updated last month
- Probabilistic programming with large language models☆134Updated last month
- Minimal yet performant LLM examples in pure JAX☆151Updated last week
- LoRA for arbitrary JAX models and functions☆142Updated last year
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Train very large language models in Jax.☆208Updated last year
- ☆102Updated last month
- Minimal but scalable implementation of large language models in JAX☆35Updated last week
- Inference code for LLaMA models in JAX☆118Updated last year
- Understand and test language model architectures on synthetic tasks.☆224Updated last month
- ☆87Updated last year
- A JAX-native LLM Post-Training Library☆134Updated this week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆133Updated 4 months ago
- ☆52Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated 11 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆188Updated 3 weeks ago
- ☆186Updated last year
- ☆233Updated 6 months ago
- ☆277Updated last year
- Experiment of using Tangent to autodiff triton☆81Updated last year