determined-ai / determined-examples
Example ML projects that use the Determined library.
☆29Updated 6 months ago
Alternatives and similar repositories for determined-examples:
Users that are interested in determined-examples are comparing it to the libraries listed below
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆59Updated 5 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆64Updated 3 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆44Updated this week
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆110Updated 3 months ago
- ☆62Updated last month
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆51Updated last week
- Example of applying CUDA graphs to LLaMA-v2☆12Updated last year
- Benchmarking PyTorch 2.0 different models☆21Updated 2 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- Train, tune, and infer Bamba model☆86Updated 2 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆79Updated 2 weeks ago
- ☆69Updated 4 months ago
- ☆14Updated last month
- A safetensors extension to efficiently store sparse quantized tensors on disk☆91Updated this week
- ☆12Updated 3 weeks ago
- A minimal implementation of vllm.☆36Updated 8 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆71Updated 6 months ago
- Cascade Speculative Drafting☆29Updated 11 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- Utilities for Training Very Large Models☆58Updated 6 months ago
- Linear Attention Sequence Parallelism (LASP)☆79Updated 9 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆58Updated 2 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆71Updated 11 months ago
- Hydragen: High-Throughput LLM Inference with Shared Prefixes☆35Updated 10 months ago
- ☆48Updated last year
- Explore training for quantized models☆17Updated 2 months ago
- ☆66Updated last week
- PyTorch bindings for CUTLASS grouped GEMM.☆74Updated 4 months ago
- QuIP quantization☆52Updated last year
- Odysseus: Playground of LLM Sequence Parallelism☆66Updated 9 months ago