facebookresearch/pixio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/pixio)

facebookresearch / pixio

[CVPR 2026] Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction

☆458

Alternatives and similar repositories for pixio

Users that are interested in pixio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

QitaoZhao / E-RayZer
View on GitHub
[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
☆301May 30, 2026Updated last month
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆739Updated this week
Robbyant / lingbot-vision
View on GitHub
Self-supervised learning for spatial perception
☆868Jul 8, 2026Updated 2 weeks ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
SihanXU / nepa
View on GitHub
PyTorch implementation of NEPA
☆338Feb 9, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hwjiang1510 / RayZer
View on GitHub
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
☆444Nov 24, 2025Updated 8 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆742Dec 18, 2025Updated 7 months ago
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,329Apr 13, 2026Updated 3 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,978Feb 25, 2026Updated 5 months ago
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆11,010Jul 15, 2026Updated last week
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
MCG-NJU / PixNerd
View on GitHub
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
☆183Dec 10, 2025Updated 7 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,091Jul 3, 2026Updated 3 weeks ago
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
End2End-Diffusion / iREPA
View on GitHub
[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
☆258Dec 15, 2025Updated 7 months ago
LTH14 / JiT
View on GitHub
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
☆2,467Dec 8, 2025Updated 7 months ago
ethz-vlg / mvtracker
View on GitHub
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
☆512Nov 3, 2025Updated 8 months ago
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆387Apr 17, 2026Updated 3 months ago
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,591Jul 17, 2026Updated last week
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,902May 29, 2026Updated last month
cvlab-kaist / GLD
View on GitHub
Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"
☆235Updated this week
wimmerth / anyup
View on GitHub
[ICLR '26 Oral] Official repository of the paper "AnyUp: Universal Feature Upsampling".
☆569Apr 17, 2026Updated 3 months ago
facebookresearch / 4DGT
View on GitHub
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
☆469Sep 19, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JaceyHuang / Gen3R
View on GitHub
[CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
☆366Mar 20, 2026Updated 4 months ago
andrehuang / loftup
View on GitHub
[ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"
☆261Jan 13, 2026Updated 6 months ago
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,468Aug 27, 2025Updated 10 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆944Oct 27, 2025Updated 8 months ago
Inception3D / TTT3R
View on GitHub
[ICLR 2026] A simple state update rule to enhance length generalization for CUT3R
☆710May 11, 2026Updated 2 months ago
Self-Evo / SelfEvo
View on GitHub
Self-Improving 4D Perception via Self-Distillation
☆72Apr 10, 2026Updated 3 months ago
nv-tlabs / vipe
View on GitHub
ViPE: Video Pose Engine for Geometric 3D Perception
☆2,051Jun 9, 2026Updated last month
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆200Mar 25, 2026Updated 4 months ago
gangweix / pixel-perfect-depth
View on GitHub
[NeurIPS 2025] Pixel-Perfect Depth
☆1,059Feb 13, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆543Oct 31, 2025Updated 8 months ago
facebookresearch / lagernvs
View on GitHub
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
☆403Jun 26, 2026Updated last month
ant-research / FLARE
View on GitHub
☆721May 1, 2025Updated last year
nanovisionx / RAEv2
View on GitHub
Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders
☆310May 21, 2026Updated 2 months ago
NVlabs / AnyFlow
View on GitHub
Flow Map OPD for AnyStep Video Diffusion
☆399May 23, 2026Updated 2 months ago
alansong1322 / VECA
View on GitHub
Elastic Attention Cores for Scalable Vision Transformers
☆15May 13, 2026Updated 2 months ago
facebookresearch / vggt-omega
View on GitHub
[CVPR 2026 Oral] VGGT Omega
☆3,692Jul 15, 2026Updated last week