☆44Jan 14, 2026Updated last month
Alternatives and similar repositories for FastSSL
Users that are interested in FastSSL are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2☆195Mar 15, 2025Updated 11 months ago
- ☆16Jul 21, 2023Updated 2 years ago
- [ICCV 2025] Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation☆32Sep 18, 2025Updated 5 months ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆70May 2, 2025Updated 10 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 7 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆32May 2, 2025Updated 10 months ago
- ☆27Mar 3, 2025Updated 11 months ago
- ☆22Jul 3, 2025Updated 7 months ago
- RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2☆39Aug 29, 2025Updated 6 months ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆32Jul 13, 2024Updated last year
- [NeurIPS 2024] RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation☆28Oct 28, 2025Updated 4 months ago
- ☆28Jul 30, 2024Updated last year
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 9 months ago
- [ICML 2024] Matrix Information Theory for Self-supervised Learning (https://arxiv.org/abs/2305.17326)☆31Sep 21, 2025Updated 5 months ago
- Command-line tool for extracting DINOv3, CLIP, SigLIP2, RADIO, features for images and videos☆60Oct 1, 2025Updated 5 months ago
- ☆28Jul 22, 2024Updated last year
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos☆27Jun 24, 2024Updated last year
- Scaling Vision Pre-Training to 4K Resolution☆221Jan 4, 2026Updated last month
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆40Oct 19, 2025Updated 4 months ago
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆33Jan 26, 2026Updated last month
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆131Aug 21, 2024Updated last year
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆29Sep 26, 2024Updated last year
- [TPAMI 2022 & CVPR 2020 Oral] Dynamic Graph Message Passing Networks☆32Sep 21, 2022Updated 3 years ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆103Jul 18, 2025Updated 7 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 9 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186May 21, 2025Updated 9 months ago
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- Javascript-powered Swype interface☆16Apr 15, 2013Updated 12 years ago
- Code for Scaling Language-Free Visual Representation Learning (WebSSL)☆245Apr 24, 2025Updated 10 months ago
- Cosmos-Transfer1-7B-Sample-AV Toolkits☆46Jun 11, 2025Updated 8 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆63Aug 6, 2025Updated 6 months ago
- ☆38Oct 10, 2024Updated last year
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆337Feb 5, 2024Updated 2 years ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆95May 17, 2024Updated last year
- ☆39Jan 31, 2023Updated 3 years ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆52Sep 10, 2025Updated 5 months ago