lucasb-eyer / cnn_vit_benchmarksLinks
☆16Updated last year
Alternatives and similar repositories for cnn_vit_benchmarks
Users that are interested in cnn_vit_benchmarks are comparing it to the libraries listed below
Sorting:
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆43Updated last week
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆30Updated 4 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Notebooks to demonstrate TimmWrapper☆16Updated 10 months ago
- ☆59Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆54Updated 10 months ago
- Notebooks for fine tuning pali gemma☆117Updated 7 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated 11 months ago
- [ICCV25] Official Implementation of LeGrad☆83Updated last year
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆77Updated 3 years ago
- Easily run PyTorch on multiple GPUs & machines☆54Updated 2 weeks ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated last year
- A holistic evaluation library for multi-modal generative models using Weave☆27Updated last year
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆45Updated 6 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 7 months ago
- ☆91Updated last year
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- ☆23Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆99Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆78Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆31Updated 8 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆68Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated 2 months ago
- Generating Captions via Perceiver-Resampler Cross-Attention Networks☆17Updated 2 years ago