ahmadmustafaanis / C4AI-Scholars-Challenge
☆12Updated last year
Alternatives and similar repositories for C4AI-Scholars-Challenge:
Users that are interested in C4AI-Scholars-Challenge are comparing it to the libraries listed below
- Easily run PyTorch on multiple GPUs & machines☆45Updated 3 weeks ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- ☆76Updated 9 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Building GPT ...☆17Updated 4 months ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace☆41Updated 4 months ago
- ☆21Updated 3 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆172Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- A basic pure pytorch implementation of flash attention☆16Updated 5 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- ML/DL Math and Method notes☆60Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆123Updated 11 months ago
- ☆37Updated last year
- WIP☆93Updated 7 months ago
- ☆13Updated 3 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆63Updated 6 months ago
- ☆53Updated last year
- Highly commented implementations of Transformers in PyTorch☆135Updated last year
- Implementation of the proposed Spline-Based Transformer from Disney Research☆88Updated 5 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆157Updated last year
- ☆44Updated 2 months ago
- ☆67Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- A holistic evaluation library for multi-modal generative models using Weave☆28Updated 5 months ago
- Memory-efficient transformer. Work in progress.☆19Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago