lucasb-eyer / cnn_vit_benchmarksLinks

☆16

Alternatives and similar repositories for cnn_vit_benchmarks

Users that are interested in cnn_vit_benchmarks are comparing it to the libraries listed below

Sorting:

sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated last year
andravin / spio
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆43Updated last week
huggingface / chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
☆160Updated last year
ariG23498 / mmdp
☆30Updated 4 months ago
sayakpaul / big_vision_experiments
Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.
☆22Updated 2 years ago
ariG23498 / timm-wrapper-examples
Notebooks to demonstrate TimmWrapper
☆16Updated 10 months ago
apple / ml-mofi
☆59Updated last year
AnonymousAlethiometer / SGD_SaI
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆54Updated 10 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 7 months ago
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆42Updated 11 months ago
WalBouss / LeGrad
[ICCV25] Official Implementation of LeGrad
☆83Updated last year
elsevierlabs-os / clip-image-search
Fine-tuning OpenAI CLIP Model for Image Search on medical images
☆77Updated 3 years ago
apoorvkh / torchrunx
Easily run PyTorch on multiple GPUs & machines
☆54Updated 2 weeks ago
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆61Updated last year
wandb / Hemm
A holistic evaluation library for multi-modal generative models using Weave
☆27Updated last year
mbzuai-oryx / ALM-Bench
[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…
☆45Updated 6 months ago
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated last year
visual-layer / visuallayer
Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …
☆69Updated 7 months ago
cloneofsimo / min-fsdp
☆91Updated last year
albanie / foundation-models
Video descriptions of research papers relating to foundation models and scaling
☆30Updated 2 years ago
SriramB-98 / vit-decompose
☆23Updated 11 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆99Updated last month
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
surkovv / sdxl-unbox
Sparse Autoencoders for Stable Diffusion XL models.
☆78Updated last month
penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆31Updated 8 months ago
marqo-ai / GCL
Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…
☆68Updated 2 months ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆23Updated last year
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆148Updated 2 months ago
shan18 / Perceiver-Resampler-XAttn-Captioning
Generating Captions via Perceiver-Resampler Cross-Attention Networks
☆17Updated 2 years ago