tensorwavecloud / ScalarLMLinks
ScalarLM - a unified training and inference stack
☆44Updated this week
Alternatives and similar repositories for ScalarLM
Users that are interested in ScalarLM are comparing it to the libraries listed below
Sorting:
- ☆188Updated 3 weeks ago
- Cray-LM unified training and inference stack.☆22Updated 5 months ago
- ☆89Updated 9 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated this week
- Foyle is a copilot to help developers deploy and operate their applications.☆131Updated 4 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆49Updated this week
- SIMD quantization kernels☆73Updated this week
- ☆186Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆96Updated this week
- ☆214Updated 5 months ago
- look how they massacred my boy☆63Updated 9 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆144Updated last year
- ☆38Updated 11 months ago
- lossily compress representation vectors using product quantization☆57Updated 2 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆46Updated this week
- A framework for optimizing DSPy programs with RL☆91Updated this week
- ☆369Updated this week
- ☆64Updated last month
- PyTorch Single Controller☆325Updated this week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆91Updated last month
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆70Updated 5 months ago
- Where GPUs get cooked 👩🍳🔥☆236Updated 4 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆110Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆63Updated last month
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 5 months ago
- Because it's there.☆16Updated 9 months ago
- ☆228Updated last week
- 👷 Build compute kernels☆77Updated this week