tensorwavecloud / ScalarLMLinks
ScalarLM - a unified training and inference stack
☆94Updated last month
Alternatives and similar repositories for ScalarLM
Users that are interested in ScalarLM are comparing it to the libraries listed below
Sorting:
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆151Updated 3 months ago
- ☆68Updated 7 months ago
- look how they massacred my boy☆63Updated last year
- ☆214Updated 2 weeks ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 3 months ago
- Verbosity control for AI agents☆65Updated last year
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆133Updated this week
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆175Updated 8 months ago
- Cray-LM unified training and inference stack.☆22Updated 11 months ago
- ☆40Updated last year
- Storing long contexts in tiny caches with self-study☆228Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆100Updated 5 months ago
- ☆62Updated 5 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- Routing on Random Forest (RoRF)☆237Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 9 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week
- ☆90Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆87Updated last week
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ☆53Updated 10 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆19Updated 11 months ago
- SIMD quantization kernels☆93Updated 4 months ago