NVlabs / PS3Links
Scaling Vision Pre-Training to 4K Resolution
☆202Updated 2 weeks ago
Alternatives and similar repositories for PS3
Users that are interested in PS3 are comparing it to the libraries listed below
Sorting:
- Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".☆224Updated 5 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆182Updated 4 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆86Updated 6 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆171Updated 5 months ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆172Updated 6 months ago
- An open source implementation of CLIP (With TULIP Support)☆162Updated 4 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆157Updated 5 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆211Updated 5 months ago
- ☆190Updated 3 months ago
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2