tensorwavecloud / ScalarLMLinks
ScalarLM - a unified training and inference stack
☆36Updated 2 weeks ago
Alternatives and similar repositories for ScalarLM
Users that are interested in ScalarLM are comparing it to the libraries listed below
Sorting:
- Cray-LM unified training and inference stack.☆22Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- SIMD quantization kernels☆70Updated this week
- GPU documentation for humans☆66Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Verbosity control for AI agents☆63Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆44Updated this week
- High-Performance SGEMM on CUDA devices☆94Updated 4 months ago
- ☆59Updated 2 weeks ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆74Updated last week
- High-Performance Engine for Multi-Vector Search☆80Updated this week
- ☆24Updated this week
- Lego for GRPO☆28Updated last week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆88Updated 2 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month
- ☆29Updated 6 months ago
- Ongoing research training transformer models at scale☆37Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- ☆89Updated 8 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- Modded vLLM to run pipeline parallelism over public networks☆35Updated 2 weeks ago
- ☆15Updated 2 months ago
- look how they massacred my boy☆63Updated 7 months ago
- Train, tune, and infer Bamba model☆127Updated this week
- Simple repository for training small reasoning models☆31Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆100Updated 3 months ago
- ☆19Updated 10 months ago
- ☆99Updated last week