apple / ml-cross-entropyLinks
β550Updated last month
Alternatives and similar repositories for ml-cross-entropy
Users that are interested in ml-cross-entropy are comparing it to the libraries listed below
Sorting:
- Implementation of π Ring Attention, from Liu et al. at Berkeley AI, in Pytorchβ544Updated 6 months ago
- π Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flashβ¦β272Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β356Updated 11 months ago
- Load compute kernels from the Hubβ327Updated last week
- Helpful tools and examples for working with flex-attentionβ1,059Updated this week
- β907Updated 2 weeks ago
- Large Context Attentionβ752Updated last month
- Efficient LLM Inference over Long Sequences