ahmadmustafaanis / C4AI-Scholars-ChallengeLinks
☆12Updated last year
Alternatives and similar repositories for C4AI-Scholars-Challenge
Users that are interested in C4AI-Scholars-Challenge are comparing it to the libraries listed below
Sorting:
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆33Updated 7 months ago
- Easily run PyTorch on multiple GPUs & machines☆46Updated 2 months ago
- A basic pure pytorch implementation of flash attention☆16Updated 7 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆126Updated 7 months ago
- Unity Machine Learning Agents Toolkit☆48Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Building GPT ...☆17Updated 6 months ago
- ☆37Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆53Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- ☆124Updated 7 months ago
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- ML/DL Math and Method notes☆61Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- ☆20Updated last year
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- ☆13Updated last week
- FID computation in Jax/Flax.☆27Updated 10 months ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- WIP☆93Updated 9 months ago
- ☆14Updated last year
- Collection of autoregressive model implementation☆85Updated last month
- ☆51Updated 11 months ago
- ☆78Updated 11 months ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year