philippe-eecs/vitok

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/philippe-eecs/vitok)

philippe-eecs / vitok

☆34

Alternatives and similar repositories for vitok

Users that are interested in vitok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanford-iris-lab / segmenting_feats
View on GitHub
☆13Nov 1, 2023Updated 2 years ago
philippe-eecs / small-vision
View on GitHub
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Jun 26, 2024Updated 2 years ago
shlokk / object-cropping-ssl
View on GitHub
This repo contains the code for the paper "Object-cropping for SSL".
☆18Feb 14, 2023Updated 3 years ago
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 4 months ago
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CVMI-Lab / SlotMIM
View on GitHub
(CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
☆25Mar 11, 2025Updated last year
CLAIRE-Labo / flash_attention
View on GitHub
A basic pure pytorch implementation of flash attention
☆17Oct 28, 2024Updated last year
csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
ThomasMrY / VCT
View on GitHub
[NeurIPS 2022] code for "Visual Concepts Tokenization"
☆23Oct 10, 2022Updated 3 years ago
google-deepmind / detcon
View on GitHub
☆62Oct 29, 2022Updated 3 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆181Mar 18, 2026Updated 4 months ago
liyz15 / Aligning-Latent-Spaces-with-Flow-Priors
View on GitHub
☆43Jun 6, 2025Updated last year
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
wusize / Harmon
View on GitHub
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
☆191May 21, 2025Updated last year
facebookresearch / metamorph
View on GitHub
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆235Jan 22, 2026Updated 5 months ago
MKJia / MGVQ
View on GitHub
[Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
☆55Sep 16, 2025Updated 10 months ago
RWKV / RWKV-block
View on GitHub
PyTorch implementation of RWKV blocks
☆31Jul 22, 2025Updated 11 months ago
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
mihirp1998 / Slot-TTA
View on GitHub
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
☆26Jun 20, 2023Updated 3 years ago
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
edouardoyallon / acco
View on GitHub
ACCO: An optimization algorithm for sharded distributed LLM training.
☆13May 22, 2025Updated last year
FouierL / EquS
View on GitHub
[WACV 2026]Official Code of the paper “Equivariant Sampling for Improving Diffusion Model-based Image Restoration“
☆19Jan 29, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
nisten / grokadamw
View on GitHub
new optimizer
☆20Aug 4, 2024Updated last year
Share14 / ShareGemini
View on GitHub
☆32Jul 29, 2024Updated last year
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
facebookresearch / webssl
View on GitHub
Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).
☆214Mar 20, 2026Updated 4 months ago
svi-diffusion / codes
View on GitHub
Official repository for "Solving Video Inverse Problems Using Image Diffusion Models"
☆11Mar 7, 2026Updated 4 months ago
Zyphra / tree_attention
View on GitHub
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆135Dec 3, 2024Updated last year
hankook / CLEL
View on GitHub
☆17Mar 2, 2023Updated 3 years ago
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆324Oct 12, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DTennant / distill_visual_priors
View on GitHub
2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261
☆13Aug 22, 2021Updated 4 years ago
yitong91 / Multiple-Domain-Matching-Network
View on GitHub
Extracting Relationships by Multi-Domain Matching
☆11Mar 21, 2019Updated 7 years ago
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
ys-zong / MIRB
View on GitHub
Benchmarking Multi-Image Understanding in Vision and Language Models
☆11Jul 29, 2024Updated last year
yinboc / dito
View on GitHub
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆169Jan 31, 2025Updated last year
francois-rozet / lola
View on GitHub
Official implementation of "Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation"
☆65Mar 19, 2026Updated 4 months ago
sherbret / normalization_equivariant_nn
View on GitHub
Official implementation of the paper "Normalization-Equivariant Neural Networks with Application to Image Denoising" (NeurIPS'23)
☆16Jun 27, 2025Updated last year