tensorwavecloud / ScalarLMLinks

ScalarLM - a unified training and inference stack

☆40

Alternatives and similar repositories for ScalarLM

Users that are interested in ScalarLM are comparing it to the libraries listed below

Sorting:

dorjeduck / momograd
A Learning Journey: Micrograd in Mojo 🔥
☆61Updated 8 months ago
deepsilicon / Sila
☆89Updated 8 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆70Updated 4 months ago
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆72Updated this week
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆95Updated last month
cray-lm / cray-lm
Cray-LM unified training and inference stack.
☆22Updated 4 months ago
mobiusml / aana_sdk
Aana SDK is a powerful framework for building AI enabled multimodal applications.
☆47Updated last week
gpu-mode / discord-cluster-manager
Write a fast kernel and run it on Discord. See how you compare against the best!
☆46Updated this week
bipul1010 / agents_tutorial
☆19Updated 10 months ago
HazyResearch / train-tk
train with kittens!
☆60Updated 8 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆53Updated 4 months ago
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
charlesfrye / minimodal
A miniature version of Modal
☆20Updated last year
Amplify-Partners / annotation-reading-list
A reading list of relevant papers and projects on foundation model annotation
☆27Updated 4 months ago
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 9 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
modal-labs / gpu-glossary
GPU documentation for humans
☆70Updated last week
BBischof / yapping
Verbosity control for AI agents
☆63Updated last year
jxmorris12 / embzip
lossily compress representation vectors using product quantization
☆57Updated 2 months ago
basetenlabs / Workshop-TRT-LLM
☆19Updated last year
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆71Updated this week
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 9 months ago
brendanhogan / picoDeepResearch
☆63Updated last month
lightonai / fast-plaid
High-Performance Engine for Multi-Vector Search
☆106Updated 3 weeks ago
cloneofsimo / ptx-tutorial-by-aislop
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Updated 3 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆78Updated 6 months ago
jlscheerer / xtr-warp
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆131Updated last month
AnswerDotAI / GeminiSave
☆50Updated 2 months ago
AI-Hypercomputer / RecML
☆183Updated this week
PrimeIntellect-ai / prime-vllm
Modded vLLM to run pipeline parallelism over public networks
☆37Updated last month