tensorwavecloud / ScalarLMLinks
ScalarLM - a unified training and inference stack
☆94Updated 3 weeks ago
Alternatives and similar repositories for ScalarLM
Users that are interested in ScalarLM are comparing it to the libraries listed below
Sorting:
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 3 months ago
- look how they massacred my boy☆63Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆56Updated 2 months ago
- ☆68Updated 6 months ago
- Cray-LM unified training and inference stack.☆22Updated 10 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆148Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month
- ☆40Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated last month
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 4 months ago
- Storing long contexts in tiny caches with self-study☆218Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Verbosity control for AI agents☆64Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- SIMD quantization kernels☆93Updated 3 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆270Updated this week
- ☆62Updated 5 months ago
- ☆213Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- ☆68Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆143Updated 8 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 8 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆173Updated 7 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆67Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆239Updated this week
- Train, tune, and infer Bamba model☆137Updated 6 months ago
- ☆31Updated last year
- ☆219Updated 10 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆26Updated 6 months ago