Ma-Lab-Berkeley/CRATE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ma-Lab-Berkeley/CRATE)

Ma-Lab-Berkeley / CRATE

Code for CRATE (Coding RAte reduction TransformEr).

☆1,275

Alternatives and similar repositories for CRATE

Users that are interested in CRATE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCSC-VLAA / CRATE-alpha
View on GitHub
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
☆47Jun 3, 2024Updated 2 years ago
Ma-Lab-Berkeley / ReduNet
View on GitHub
ReduNet
☆541Feb 17, 2022Updated 4 years ago
Ma-Lab-Berkeley / MCR2
View on GitHub
☆86May 24, 2021Updated 5 years ago
Delay-Xili / LDR
View on GitHub
The official PyTorch implementation of the paper: Xili Dai, Shengbang Tong, et al. "Closed-Loop Data Transcription to an LDR via Minimaxi…
☆64Nov 3, 2022Updated 3 years ago
tsb0601 / EMP-SSL
View on GitHub
This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."
☆229Aug 21, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
state-spaces / mamba
View on GitHub
Mamba SSM architecture
☆18,675Jul 22, 2026Updated last week
ryanchankh / mcr2
View on GitHub
Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)
☆205Dec 8, 2022Updated 3 years ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,693May 31, 2024Updated 2 years ago
Delay-Xili / uCTRL
View on GitHub
☆15Apr 6, 2023Updated 3 years ago
zengyi-li / NMCE-release
View on GitHub
Code for Neural Manifold Clustering and Embedding
☆63Mar 11, 2022Updated 4 years ago
ryanchankh / redunet_paper
View on GitHub
Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)
☆60Apr 24, 2021Updated 5 years ago
LeslieTrue / CPP
View on GitHub
This is the official implementation for Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models.
☆32Jun 16, 2023Updated 3 years ago
Delay-Xili / SDNet
View on GitHub
An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"
☆124Apr 1, 2023Updated 3 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,165Jun 3, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,559Updated this week
openai / consistency_models
View on GitHub
Official repo for consistency models.
☆6,492Mar 22, 2024Updated 2 years ago
cambrian-mllm / cambrian
View on GitHub
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
☆2,009Nov 7, 2025Updated 8 months ago
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,950Aug 12, 2024Updated last year
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,172Jan 23, 2026Updated 6 months ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,714Nov 10, 2025Updated 8 months ago
ytongbai / LVM
View on GitHub
☆1,836Jun 28, 2024Updated 2 years ago
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,083Mar 25, 2026Updated 4 months ago
facebookresearch / MetaCLIP
View on GitHub
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
☆1,849Nov 27, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
invictus717 / MetaTransformer
View on GitHub
Meta-Transformer for Unified Multimodal Learning
☆1,649Dec 5, 2023Updated 2 years ago
deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆42,827Updated this week
facebookresearch / ImageBind
View on GitHub
ImageBind One Embedding Space to Bind Them All
☆9,060Nov 21, 2025Updated 8 months ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,377Jul 23, 2024Updated 2 years ago
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,614Sep 18, 2024Updated last year
baaivision / Emu
View on GitHub
Emu Series: Generative Multimodal Models from BAAI
☆1,776Jan 12, 2026Updated 6 months ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,444Jun 22, 2026Updated last month
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,022Updated this week
BradyFU / Awesome-Multimodal-Large-Language-Models
View on GitHub
Latest Advances on Multimodal Large Language Models
☆17,958Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,641Updated this week
DruvPai / DiffusionLab
View on GitHub
Easy no-frills Jax implementations of common abstractions for simple diffusion models.
☆11Feb 23, 2026Updated 5 months ago
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,460Updated this week
microsoft / LoRA
View on GitHub
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆13,692Dec 17, 2024Updated last year
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,432Jan 12, 2026Updated 6 months ago
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,463Updated this week
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago