lucasb-eyer / cnn_vit_benchmarksLinks
☆15Updated last year
Alternatives and similar repositories for cnn_vit_benchmarks
Users that are interested in cnn_vit_benchmarks are comparing it to the libraries listed below
Sorting:
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆42Updated 6 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 7 months ago
- ☆27Updated last month
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- ☆59Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 9 months ago
- A holistic evaluation library for multi-modal generative models using Weave☆28Updated 10 months ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 3 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Notebooks for fine tuning pali gemma☆112Updated 4 months ago
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆20Updated last year
- Easily run PyTorch on multiple GPUs & machines☆46Updated 2 months ago
- ☆23Updated 7 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆103Updated last year
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆64Updated 4 months ago
- Video descriptions of research papers relating to foundation models and scaling☆31Updated 2 years ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆77Updated 3 years ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 9 months ago
- Load any clip model with a standardized interface☆22Updated last week
- ☆65Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆131Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆69Updated last month
- MatFormer repo☆62Updated 8 months ago
- Train vision models using JAX and 🤗 transformers☆99Updated this week
- Timm model explorer☆41Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆82Updated 8 months ago
- Generating Captions via Perceiver-Resampler Cross-Attention Networks☆17Updated 2 years ago
- [ICCV25] Official Implementation of LeGrad☆78Updated 10 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆86Updated 2 months ago