google-deepmind/tips

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/tips)

google-deepmind / tips

TIPSv2 (CVPR'26) and TIPS (ICLR'25)

☆572

Alternatives and similar repositories for tips

Users that are interested in tips are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / EUPE
View on GitHub
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized…
☆685Apr 14, 2026Updated 3 months ago
wimmerth / anyup
View on GitHub
[ICLR '26 Oral] Official repository of the paper "AnyUp: Universal Feature Upsampling".
☆569Apr 17, 2026Updated 3 months ago
visinf / INSID3
View on GitHub
[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"
☆677Jun 26, 2026Updated 3 weeks ago
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,897May 29, 2026Updated last month
manugaurdl / SteerViT
View on GitHub
SteerViT is a framework that equips any ViT with the ability to steer both its global and local visual representations with natural langu…
☆111Jun 13, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,324Apr 13, 2026Updated 3 months ago
Robbyant / lingbot-vision
View on GitHub
Self-supervised learning for spatial perception
☆832Jul 8, 2026Updated last week
facebookresearch / metadepth
View on GitHub
Efficient image to 3D geometry foundation models from Meta Reality Labs for monocular depth, point maps, and surface normals. Featuring H…
☆60May 20, 2026Updated 2 months ago
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆10,973Updated this week
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆738Updated this week
facebookresearch / sapiens2
View on GitHub
1K resolution vision transformers pretrained on 1B human images.
☆875May 24, 2026Updated last month
YanFangCS / GenLIP
View on GitHub
Official repo for "Let ViT Speak: Generative Language-Image Pre-training"
☆133Jun 10, 2026Updated last month
tiiuae / siglino
View on GitHub
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Models
☆53Jun 11, 2026Updated last month
hustvl / SuperCLIP
View on GitHub
☆140Dec 26, 2025Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,372Mar 23, 2026Updated 3 months ago
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆5,917Updated this week
facebookresearch / pixio
View on GitHub
[CVPR 2026] Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction
☆457Updated this week
davnords / MuM
View on GitHub
[CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.
☆120Jul 14, 2026Updated last week
ByteDance-Seed / SAIL
View on GitHub
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
☆85Oct 29, 2025Updated 8 months ago
ClaudiaCuttano / SANSA
View on GitHub
[NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."
☆203Dec 17, 2025Updated 7 months ago
metric-anything / metric-anything
View on GitHub
Accepted to ECCV 2026
☆338Jul 6, 2026Updated 2 weeks ago
UCSC-VLAA / OpenVision
View on GitHub
OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3
☆487Feb 21, 2026Updated 5 months ago
tue-mps / eomt
View on GitHub
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
☆613Jul 3, 2026Updated 2 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ga1i13o / JIST
View on GitHub
Official repository of the paper "JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition"
☆24Dec 15, 2023Updated 2 years ago
OpenSenseNova / SenseNova-Vision
View on GitHub
Vision as Unified Multimodal Generation
☆430Updated this week
facebookresearch / webssl
View on GitHub
Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).
☆214Mar 20, 2026Updated 4 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,072Jul 3, 2026Updated 2 weeks ago
RADSeg-OVSS / RADSeg
View on GitHub
[CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…
☆60May 31, 2026Updated last month
PaulCouairon / JAFAR
View on GitHub
[NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution
☆235Nov 24, 2025Updated 7 months ago
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,569Updated this week
facebookresearch / sam3
View on GitHub
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…
☆11,016Updated this week
vlongle / pixie
View on GitHub
Feed-forward model for predicting 3D physics with 3DGS + NeRF
☆297Mar 5, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆542Oct 31, 2025Updated 8 months ago
ninaddaithankar / tdv
View on GitHub
PyTorch code for the paper "You Don’t Need Strong Assumptions: Visual Representation Learning via Temporal Differences"
☆106Jun 18, 2026Updated last month
google-deepmind / representations4d
View on GitHub
☆180Jun 8, 2026Updated last month
facebookresearch / boxer
View on GitHub
Code for the Boxer research paper
☆598Jul 1, 2026Updated 2 weeks ago
IDEA-Research / Rex-Omni
View on GitHub
[CVPR2026] Detect Anything via Next Point Prediction
☆1,507Feb 22, 2026Updated 4 months ago
davnords / LoMa
View on GitHub
[ECCV 2026] LoMa: Local Feature Matching Revisited
☆360Jul 3, 2026Updated 2 weeks ago
gangweix / pixel-perfect-depth
View on GitHub
[NeurIPS 2025] Pixel-Perfect Depth
☆1,059Feb 13, 2026Updated 5 months ago