lucasb-eyer / cnn_vit_benchmarksLinks
☆15Updated last year
Alternatives and similar repositories for cnn_vit_benchmarks
Users that are interested in cnn_vit_benchmarks are comparing it to the libraries listed below
Sorting:
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆42Updated 5 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 6 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 8 months ago
- ☆59Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆158Updated last year
- ☆27Updated 3 weeks ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆70Updated 3 months ago
- Notebooks for fine tuning pali gemma☆112Updated 3 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Timm model explorer☆41Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆69Updated last week
- Easily run PyTorch on multiple GPUs & machines☆46Updated last month
- Focused on fast experimentation and simplicity☆76Updated 7 months ago
- ☆78Updated 9 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 9 months ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆52Updated 6 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆76Updated 3 years ago
- A holistic evaluation library for multi-modal generative models using Weave☆28Updated 9 months ago
- ☆83Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆82Updated 7 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- ☆22Updated 7 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆94Updated 7 months ago
- Train vision models using JAX and 🤗 transformers☆98Updated this week
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆88Updated 2 months ago
- MatFormer repo☆59Updated 7 months ago
- Autoregressive Image Generation☆32Updated last month
- ☆86Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆139Updated last year