ahmadmustafaanis / C4AI-Scholars-Challenge
☆12Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for C4AI-Scholars-Challenge
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 10 months ago
- 🧰 The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! ✨☆11Updated this week
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆52Updated last year
- Building GPT ...☆17Updated 3 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- ☆35Updated 7 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆80Updated 11 months ago
- ☆19Updated 7 months ago
- ML/DL Math and Method notes☆57Updated 11 months ago
- All about the fundamentals and working of Diffusion Models☆152Updated last year
- ☆73Updated 4 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 6 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆95Updated 3 weeks ago
- A holistic evaluation library for multi-modal generative models using Weave☆27Updated 3 weeks ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆118Updated 3 weeks ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 2 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- deep learning with pytorch lightning☆0Updated 3 weeks ago
- Textbook on reinforcement learning from human feedback☆76Updated 3 weeks ago
- A basic pure pytorch implementation of flash attention☆16Updated 3 weeks ago
- Automatically take good care of your preemptible TPUs☆32Updated last year
- Yet another mini autodiff system for educational purposes☆27Updated last week
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆34Updated 8 months ago
- ☆48Updated last week
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- ☆73Updated 2 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 3 years ago