[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
☆541Feb 25, 2026Updated last week
Alternatives and similar repositories for eomt
Users that are interested in eomt are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆367Sep 25, 2025Updated 5 months ago
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆27Jul 14, 2025Updated 7 months ago
- ☆21Apr 4, 2025Updated 11 months ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆130Mar 10, 2025Updated 11 months ago
- Code for the paper "A Sea of Words: An In-Depth Analysis of Anchors for Text Data", AISTATS 2023☆15Oct 26, 2024Updated last year
- Domain Randomization via Entropy Maximization☆23Apr 18, 2024Updated last year
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆185Dec 17, 2025Updated 2 months ago
- ☆35Jul 18, 2025Updated 7 months ago
- Official implementation of https://arxiv.org/abs/2106.03496☆15Jul 27, 2022Updated 3 years ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- List of papers wrote by Focoos AI research team!☆12Jun 3, 2025Updated 9 months ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- 🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…☆347Dec 11, 2025Updated 2 months ago
- ☆27May 31, 2024Updated last year
- Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…☆11Apr 29, 2024Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,181Feb 11, 2026Updated 3 weeks ago
- Official Implementation of NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering☆69Dec 1, 2025Updated 3 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆218Nov 24, 2025Updated 3 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆251Jan 13, 2026Updated last month
- World Modeling by Forecasting Vision Foundation Model Features☆35Jan 7, 2026Updated last month
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning☆263Sep 24, 2025Updated 5 months ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Oct 31, 2024Updated last year
- DROPO: Sim-to-Real Transfer with Offline Domain Randomization☆25Jul 8, 2025Updated 7 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆80Sep 18, 2025Updated 5 months ago
- Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025☆38Oct 4, 2025Updated 5 months ago
- Official repository of the paper "JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition"☆23Dec 15, 2023Updated 2 years ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,682Feb 11, 2026Updated 3 weeks ago
- [ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆63Jan 23, 2025Updated last year
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- ☆10Nov 18, 2024Updated last year
- [NeurIPS 2024] Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure☆10Nov 27, 2025Updated 3 months ago
- OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3☆457Feb 21, 2026Updated last week
- This repository contains all code and data for the Inside Out Visual Place Recognition task☆23Nov 24, 2021Updated 4 years ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆201Apr 29, 2025Updated 10 months ago
- [DEIMv2] Real Time Object Detection Meets DINOv3☆1,544Jan 7, 2026Updated last month
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆124Oct 23, 2025Updated 4 months ago
- Code for the paper "Attention Meets Post-hoc Interpretability: A Mathematical Perspective", ICML 2024☆21Nov 10, 2025Updated 3 months ago