[Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers
☆35Nov 25, 2025Updated 3 months ago
Alternatives and similar repositories for DINO-Tok
Users that are interested in DINO-Tok are comparing it to the libraries listed below
Sorting:
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 4 months ago
- A self-hosted cross-platform 3DAIGC software. Working with 3DAIGC algorithms completely deployed locally. Supported 3D workflows include …☆73Jan 18, 2026Updated last month
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆35Feb 15, 2024Updated 2 years ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆55Sep 16, 2025Updated 5 months ago
- ☆11Sep 4, 2022Updated 3 years ago
- Amazon S3 tokenizer☆10Feb 26, 2026Updated last week
- ☆10Sep 17, 2022Updated 3 years ago
- Where is the "main theme" in an orchestral score?☆13Oct 25, 2025Updated 4 months ago
- Weather4Cast 2023 NeurIPS Competition - RainAI☆15Dec 4, 2023Updated 2 years ago
- ☆46Nov 20, 2025Updated 3 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆39Feb 7, 2026Updated last month
- ☆22Dec 23, 2025Updated 2 months ago
- Official code of paper: MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly.☆103Feb 24, 2026Updated 2 weeks ago
- PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis☆33Oct 27, 2025Updated 4 months ago
- ☆11Jan 24, 2024Updated 2 years ago
- [AAAI 2025] The official implementation for the "Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation"☆18Jul 18, 2025Updated 7 months ago
- A small library of 3D related utilities used in my research.☆10Mar 5, 2022Updated 4 years ago
- ☆17Jun 24, 2025Updated 8 months ago
- Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views (with visual imitation learning for robots)☆30Updated this week
- ☆12Feb 3, 2026Updated last month
- MobileNetV2: Inverted Residuals and Linear Bottlenecks☆10Jul 22, 2019Updated 6 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- Clean singing voice with no accompaniment. Semiprofessional singers. Semiprofessional quality. Songs from classical turkish makam in şark…☆12Mar 7, 2016Updated 10 years ago
- Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"☆18Jun 3, 2025Updated 9 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- Use multiple processes to convert .mobi file to image folders, or convert to JPG compressed zip packages.☆11Jun 5, 2025Updated 9 months ago
- The official codes of Learning to Decouple the Lights for 3D Face Texture Modeling (NeurIPS'24)☆14Mar 17, 2025Updated 11 months ago
- KIRD 위성영상활용교육 - 위성영상의 이해☆15Apr 25, 2023Updated 2 years ago
- Simple Tool Box with Pytorch☆10Jan 27, 2021Updated 5 years ago
- ☆78Nov 4, 2025Updated 4 months ago
- Code for "Generating Part-Aware Editable 3D Shapes without 3D Supervision", CVPR 2023☆49Jun 22, 2024Updated last year
- Code release for: Controllable Layer Decomposition for Reversible Multi-Layer Image Generation☆44Dec 7, 2025Updated 3 months ago
- [CVPR 2026] Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration☆38Feb 25, 2026Updated last week
- ☆11Jun 3, 2023Updated 2 years ago
- transcribe guitar solo audio to midi-like tab.☆12May 18, 2022Updated 3 years ago
- The DJ Mix Dataset☆17Sep 7, 2022Updated 3 years ago
- Perceived Music Quality Dataset☆12Jul 1, 2024Updated last year