wandb / Hemm
A holistic evaluation library for multi-modal generative models using Weave
☆26Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Hemm
- ☆39Updated 9 months ago
- Scalable neural net training via automatic normalization in the modular norm.☆119Updated 2 months ago
- ☆72Updated 4 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆112Updated 6 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆46Updated last week
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆29Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated last week
- code for training & evaluating Contextual Document Embedding models☆93Updated this week
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆12Updated 3 weeks ago
- ☆76Updated 6 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆47Updated last week
- ☆100Updated 3 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆93Updated last week
- WIP☆89Updated 2 months ago
- ☆115Updated 2 weeks ago
- Notebooks for fine tuning pali gemma☆41Updated 3 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆84Updated 2 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆80Updated 10 months ago
- ☆40Updated this week
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆58Updated 3 months ago
- Mobile Viewer for W&B, built on top of Flutter.☆30Updated 8 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆151Updated 7 months ago
- ☆49Updated 7 months ago
- ☆40Updated 7 months ago
- ☆46Updated last month
- A basic pure pytorch implementation of flash attention☆15Updated 2 weeks ago
- ☆24Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Train vision models using JAX and 🤗 transformers☆95Updated 3 weeks ago