lucasb-eyer / cnn_vit_benchmarksLinks
☆16Updated last year
Alternatives and similar repositories for cnn_vit_benchmarks
Users that are interested in cnn_vit_benchmarks are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆33Updated 5 months ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated last week
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆77Updated 3 years ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 7 months ago
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- ☆65Updated 2 years ago
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆248Updated 11 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆69Updated this week
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- Easily run PyTorch on multiple GPUs & machines☆56Updated 3 weeks ago
- Notebooks to demonstrate TimmWrapper☆16Updated 11 months ago
- ☆92Updated last year
- A holistic evaluation library for multi-modal generative models using Weave☆27Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆54Updated 11 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆109Updated last year
- ☆87Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 4 months ago
- Train vision models using JAX and 🤗 transformers☆100Updated 2 weeks ago
- Focused on fast experimentation and simplicity☆76Updated last year
- Simple python template☆42Updated last year
- Timm model explorer☆42Updated last year
- [ICCV25] Official Implementation of LeGrad☆83Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆41Updated 8 months ago