determined-ai / determined-examplesLinks
Example ML projects that use the Determined library.
☆32Updated 9 months ago
Alternatives and similar repositories for determined-examples
Users that are interested in determined-examples are comparing it to the libraries listed below
Sorting:
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆59Updated 8 months ago
- Simple implementation of Speculative Sampling in NumPy for GPT-2.☆95Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆130Updated this week
- ☆15Updated 2 months ago
- Utilities for Training Very Large Models☆58Updated 9 months ago
- Various transformers for FSDP research☆37Updated 2 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 6 months ago
- ☆50Updated last year
- LM engine is a library for pretraining/finetuning LLMs☆57Updated this week
- ☆34Updated last month
- ☆72Updated 3 months ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- ☆17Updated 2 years ago
- ☆126Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- ☆74Updated 7 months ago
- MLPerf™ logging library☆36Updated 2 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆32Updated this week
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆116Updated 6 months ago
- Easy and Efficient Quantization for Transformers☆199Updated 4 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆76Updated last year
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆70Updated last year
- Personal solutions to the Triton Puzzles☆19Updated 11 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- ☆37Updated this week
- A bunch of kernels that might make stuff slower 😉☆51Updated this week
- Code for studying the super weight in LLM☆107Updated 6 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- Benchmarks to capture important workloads.☆31Updated 4 months ago
- A parallel framework for training deep neural networks☆61Updated 3 months ago