tensorwavecloud / ScalarLMLinks
ScalarLM - a unified training and inference stack
β40Updated last month
Alternatives and similar repositories for ScalarLM
Users that are interested in ScalarLM are comparing it to the libraries listed below
Sorting:
- A Learning Journey: Micrograd in Mojo π₯β61Updated 8 months ago
- β89Updated 8 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.β70Updated 4 months ago
- SIMD quantization kernelsβ72Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IPβ95Updated last month
- Cray-LM unified training and inference stack.β22Updated 4 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.β47Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!β46Updated this week
- β19Updated 10 months ago
- train with kittens!β60Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ53Updated 4 months ago
- β38Updated 11 months ago
- A miniature version of Modalβ20Updated last year
- A reading list of relevant papers and projects on foundation model annotationβ27Updated 4 months ago
- Because it's there.β16Updated 9 months ago
- look how they massacred my boyβ63Updated 8 months ago
- GPU documentation for humansβ70Updated last week
- Verbosity control for AI agentsβ63Updated last year
- lossily compress representation vectors using product quantizationβ57Updated 2 months ago
- β19Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β71Updated this week
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 9 months ago
- β63Updated last month
- High-Performance Engine for Multi-Vector Searchβ106Updated 3 weeks ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)β66Updated 3 months ago
- An introduction to LLM Samplingβ78Updated 6 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β131Updated last month
- β50Updated 2 months ago
- β183Updated this week
- Modded vLLM to run pipeline parallelism over public networksβ37Updated last month