(ICCV 2025) "Principal Components" Enable A New Language of Images
☆84Jul 28, 2025Updated 9 months ago
Alternatives and similar repositories for semanticist
Users that are interested in semanticist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,145Mar 20, 2025Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆315Jun 2, 2025Updated 11 months ago
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆25Sep 23, 2024Updated last year
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆24Mar 11, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Home Made Diffusion Models☆196Dec 9, 2025Updated 4 months ago
- [ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation☆36Jun 13, 2025Updated 10 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆180Mar 18, 2026Updated last month
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆149Feb 11, 2025Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆186Feb 24, 2026Updated 2 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 11 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated 2 years ago
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆149Jun 13, 2024Updated last year
- ☆16Mar 24, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆203Jan 7, 2026Updated 3 months ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆429Jun 20, 2025Updated 10 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- Official implementation of SimFlow☆31Dec 16, 2025Updated 4 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆249Oct 12, 2025Updated 6 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆116Nov 30, 2025Updated 5 months ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- ☆312May 29, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- ☆64Jul 11, 2025Updated 9 months ago
- recipe for training fully-featured self supervised image jepa models☆13Jun 4, 2025Updated 10 months ago
- ☆34May 14, 2025Updated 11 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆34Apr 6, 2026Updated 3 weeks ago
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated 10 months ago
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 10 months ago
- [ICCV 2025 Findings Oral] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆39Nov 20, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 3 weeks ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- Exploring Representation-Aligned Latent Space for Better Generation☆19Mar 17, 2026Updated last month
- ☆14Jan 22, 2025Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Nov 16, 2022Updated 3 years ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆387Aug 22, 2025Updated 8 months ago
- Pixel Propagation for unsupervised visual representation learning☆11Feb 16, 2021Updated 5 years ago