wandb / Hemm
A holistic evaluation library for multi-modal generative models using Weave
β28Updated 6 months ago
Alternatives and similar repositories for Hemm
Users that are interested in Hemm are comparing it to the libraries listed below
Sorting:
- Mixture-of-Transformers A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025. π https//arxiv.org/abs/2411.049β¦β31Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for trβ¦β60Updated 6 months ago
- Weights & Biases Addons is a repository consisting of additional unitilities and community contributions for supercharging your Weights &β¦β23Updated last year
- β28Updated 6 months ago
- β24Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.β40Updated 3 weeks ago
- Set of scripts to finetune LLMsβ37Updated last year
- β123Updated 6 months ago
- β56Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingβ123Updated last year
- A template to kick-start your Python project β¨πβ51Updated 4 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Modeβ¦β41Updated last month
- A miniture AI training framework for PyTorchβ42Updated 3 months ago
- Drift detection module for machine learning pipelines.β25Updated last year
- β58Updated last year
- PyTorch library for Active Fine-Tuningβ72Updated 2 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.β157Updated last year
- β22Updated last year
- Mobile Viewer for W&B, built on top of Flutter.β34Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorchβ89Updated last month
- Framework for building and maintaining self-updating prompts for LLMsβ62Updated 11 months ago
- supporting pytorch FSDP for optimizersβ80Updated 5 months ago
- β79Updated 10 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrunβ49Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ52Updated 3 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- Sphynx Hallucination Inductionβ54Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ57Updated 8 months ago
- Large scale 4D parallelism pre-training for π€ transformers in Mixture of Experts *(still work in progress)*β82Updated last year
- β81Updated last year