determined-ai / determined-examples
Example ML projects that use the Determined library.
☆14Updated last week
Related projects: ⓘ
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆58Updated this week
- ☆25Updated 9 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆57Updated 5 months ago
- Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs☆68Updated 2 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆15Updated 3 months ago
- Minimum Description Length probing for neural network representations☆15Updated 11 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆26Updated last week
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last week
- ☆130Updated this week
- Utilities for Training Very Large Models☆56Updated 2 weeks ago
- ☆40Updated 2 months ago
- Code for paper: "Privately generating tabular data using language models".☆14Updated last year
- ☆47Updated 3 months ago
- Explain a black-box module in natural language.☆33Updated 3 weeks ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆77Updated last year
- Lightning support for Intel Habana accelerators.☆25Updated 2 weeks ago
- ☆30Updated 3 months ago
- ☆9Updated 5 months ago
- Here we will test various linear attention designs.☆55Updated 4 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆17Updated 10 months ago
- ☆38Updated 9 months ago
- Using FlexAttention to compute attention with different masking patterns☆28Updated last week
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆19Updated last year
- A repository for research on medium sized language models.☆71Updated 3 months ago
- Implementation of Spectral State Space Models☆16Updated 6 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆51Updated this week
- ☆29Updated 2 weeks ago
- Reversal Curse Experiment☆13Updated 11 months ago
- ☆29Updated 3 weeks ago